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OM protein - protein search, using sw model 

Run on: August 5, 2006, 00:05:14 ; Search time 318 Seconds 

(without alignments) 
2763.411 Million cell updates/sec 

Title: US-10-763-210-1 
Perfect score: 5164 

Sequence: 1 MLLLGILTLAFAGRTAGGFE DQCNLHRKPQELDFCVLRPC 950 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 2849598 seqs, 925015592 residues 

Total number of hits satisfying chosen parameters: 2849598 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : UniProt_7 . 2 : * 

1 : uniprot^sprot : * 
2 : uniprot_trembl : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 


1 


5156 


99. 


8 


950 


1 


ATS15_HUMAN 
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ALIGNMENTS 

RESULT 1 
ATS15_HUMAN 



ID ATS15_HUMAN STANDARD; PRT; 950 AA. 

AC Q8TE5&; 

DT 28-FEB^2003, integrated into UniProtKB/Swiss-Prot . 

DT 01-JUN-f002, sequence version 1. 

DT 07-MAR^20^6, entry version 29. 

DE ADAMTS-lS^precursor (EC 3.4.24.-) (A disintegrin and metal loproteinase 

DE with thrombospondin motifs 15) (ADAM-TS 15) (ADAM-TS15). 

GN Name=ADAMTSl 5 ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; 

OC Homo . 

OX NCBI_TaxID=9606 ; 

RN [11 

RP NUCLEOTIDE SEQUENCE [MRNA] . 

RX MEDLINE=21856482; PubMed=11867212 ; DOI=10 . 1016/S0378-1119 ( 01 ) 00861-7 ; 

RA Cal S., Obaya A.J., Llamazares M. , Garabaya Quesada V., 

RA Lopez-Otin C; 

RT "Cloning, expression analysis, and structural characterization of 

RT seven novel human ADAMTSs, a family of metal loproteinases with 

RT disintegrin and thrombospondin- 1 domains.**; 

RL Gene 283:49-6 ^(2002) . 

CC -!- COFACTOR; Binds 1 zinc ion per subunit (By similarity). 

CC -!- SUBCELLULAR LOCATION: Secreted protein; extracellular space; 

CC extracellular matrix (By similarity) . 

CC -!- TISSUE SPECIFICITY: Expressed in fetal liver and kidney, but not 
CC in any of the adult tissues examined. 

CC -!- DOMAIN: The spacer domain and the TSP type-1 domains are important 
CC for a tight interaction with the extracellular matrix (By 

CC similarity) . 

CC PTM: The precursor is cleaved by a furin endopeptidase (By 

CC similarity) . 

CC -!- SIMILARITY: Contains 1 disintegrin domain. 
CC SIMILARITY: Contains 1 peptidase M12B domain. 

CC SIMILARITY: Contains 3 TSP type-1 domains. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AJ3 15733; CAC86014.1; -; mRNA. 

DR HSSP; P07996; ILSL. 

DR MEROPS; M12.025; -. 

DR Ensembl; ENSG00000166106 ; Homo sapiens. 

DR HGNC; HGNC: 16305; ADAMTS15. 

DR MIM; 607509; gene. 

DR InterPro; IPR006586; ADAM_cysteine . 

DR InterPro; IPR010294; ADAM_spacerl . 

DR InterPro; IPR013273; ADAM_TS. 
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InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 
Pfam; PF05986; 
Pfam; PF01562; 
Pfam; PF01421; 
Pfam; PF00090; 



IPR013277; ADAM_TS8 . 
IPR001762; Disintegrin. 
IPR001818; Pept_MlOA_Ml2B. 
IPR006025; Pept_M_Zn_BS . 
IPR001590; Peptidase_M12B. 
I PRO 02 870; Peptidase_M12B_N. 
IPR000884; TSPl . 
IPR008085; TSP_1 . 

ADAM_spacerl ; 1 . 
Pep_Ml 2 B_propep ; 1 . 
Reprolysin; 1. 
TSP_1; 3. 



PRINTS; PR01861; ADAMTS8. 
PRINTS; PR01857; ADAMTSFAMILY . 
PRINTS; PR01705; TSPIREPEAT. 
SMART; SM00209; TSPl; 3. 
PROSITE; PS50215; ADAM_MEPRO; 
PS00546; 
PS00427; 
PS50214; 
PS50092; 
PS00142; 



CYSTEINE_SWITCH; FALSE_NEG. 
DISINTEGRIN^l ; FALSE_NEG , 
DISINTEGRIN_2; FALSE_NEG. 
TSPl; 3. 

ZINC_PROTEASE; 1. 
Extracellular matrix; Glycoprotein; Hydrolase; Metal -binding; 
Metal loprot ease; Protease; Repeat; Signal; Zinc; Zymogen. 



PROSITE; 
PROSITE; 
PROSITE; 
PROSITE; 
PROSITE; 



FT 


SIGNAL 
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Potential . 
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PROPEP 
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By similarity. 
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/FTId=PRO_0000029192 . 
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CHAIN 
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ADAMTS-15. 






FT 








/FTId=PRO_0000029193 . 
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DOMAIN 
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Peptidase M12B. 
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DOMAIN 


428 


515 


Disintegrin. 
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DOMAIN 


516 


571 


TSP type-1 1. 






FT 


DOMAIN 


839 
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TSP type-1 2. 






FT 


DOMAIN 


896 


949 


TSP type-1 3. 






FT 


REGION 
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838 


Spacer . 






FT 


COMPBIAS 
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700 


Cys-rich. 






FT 


ACT_SITE 


362 
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By similarity. 
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METAL 


361 
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Zinc (catalytic) 


(By 


similarity) . 


FT 


METAL 


365 


365 


Zinc (catalytic) 


(By 


similarity) . 


FT 


METAL 


371 
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Zinc (catalytic) 


(By 


similarity) . 


FT 


SITE 


174 
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Cysteine switch i 


[Potential) . 
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CARBOHYD 
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N-linJ«ed (GlcNAc. 




. ) (Potential) 


FT 


CARBOHYD 
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N- linked (GlcNAc. 




. ) (Potential) 


FT 


CARBOHYD 
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N-linked (GlcNAc, 




. ) (Potential) 


FT 


CARBOHYD 


679 


679 


N- linked (GlcNAc. 




. ) (Potential) 


FT 


DISULFID 


339 


422 


By similarity. 






FT 


DISULFID 


377 


406 


By similarity. 






FT 


DISULFID 


528 


565 


By similarity. 






FT 


DISULFID 


532 


570 


By similarity. 






FT 


DISULFID 


543 


555 


By similarity. 






SQ 


SEQUENCE 


950 AA; 


103287 


MW; 5DFBE18285CCCC3B CRC64; 



Query Match 99.8%; 
Best Local Similarity 99.9%; 
Matches 949; Conservative 



Score 5156; DB 1; Length 950; 
Pred. No. 0; 
0; Mismatches 1; Indels 



0 ; Gaps 



Qy 
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Db 
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Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 



MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

Illlllllllllllllil lllllllllllllllllllllllllllllllllllllllll 
MLLLGILTLAFAGRTAGGSEPEREWVPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 



QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 



120 



120 



GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

Mill I Mill iiiiiii Mill II II II Mill III nil III II I III I III mill 

GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

NPAXLRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

MMMMMMMMMMMMMMMMMMMMMMMIMMMMMMI 

NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 



Qy 
Db 



241 YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
241 YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 
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301 


KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 
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Db 
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KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 
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HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 
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Db 
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HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQI DRANPWSACSAAI ITDFLDSGHG 
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Qy 
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DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 
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Db 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 


Qy 


481 


QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 


540 
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Db 


481 


QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQIiAR 


540 


Qy 


541 


RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 
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Db 


541 


RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 


Qy 
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VAWVPKY SGVS PRDKCKL I CRANGTGY F YVLAPKWDGTLCS PDSTSVCVQGKCI KAGCD 
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Db 


601 


VAWVPKY SGVS PRDKCKLI CRANGTGYF YVLAPKWDGTLCS PDSTSVCVQGKCI KAGCD 


660 


Qy 


661 


GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 


720 
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Db 


661 


GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 


720 


Qy 


721 


GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 


780 
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Db 


721 


GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 


780 


Qy 


781 


VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 


840 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


781 


VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 


840 


Qy 


841 


ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 


900 






llllllllllllllllllllllllllllllllllllllllllllllilllllllllllll 




Db 


841 


ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 


900 


Qy 


901 


AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 








IIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


901 


AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 





RESULT 2 
Q32MI6_HUMAN 

ID Q32^ai^_HUMiWrp^RELIMINARY; PRT; 950 AA. 
AC ^^12MI6; ' \ 

06-DEC-2005, integrated into UniProtKB/TrEMBL . 
/dt 06-DEC-2005, sp^ence version 1. 
DT 01 "FEB-lOQ^^entry version 3. 

DE^" --A'arsTntegrin-like and metal loprotease (Reprolysin type) with 

DE thrombospondin type 1 motif, 15, preproprotein. 

GN Naine=ADAMTSl5; 

OS Homo sapiens (Human) . 

OC Eu)caryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires ; Primates; Catarrhini; Hominidae; 

OC Homo . 

OX NCBI_TaxID=9606; 
RN (11 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=PCR rescued clones; 

RX MEDLINE^22388257 ; PubMed=l 2477 93 2 ; DOI=10 . 1073/pnas . 242603899; 

RA Strausberg R.L., Feingold E.A,, Grouse L.H., Derge J.G, , 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 



Altschul S.F., Zeeberg B. , Buetow K.H., Schaefer C.F., Bhat N.K., 
Hopkins R.F., Jordan H., Moore T. , Max Wang J.. Hsieh P., 

Diatchenko L. , Marusina K., Farmer A.A. , Rubin G.M., Hong L., 
Stapleton M. , Soares M.B., Bonaldo M.F.. Casavant T.L., Scheetz T.E., 
Brownstein M. J. . Usdin T.B., Toshiyuki S., Carninci P., Prange C, 
Raha S.S., Loquellano N.A., Peters G.J.. Abramson R.D., Mullahy S.J., 
Bosak S.A., McEwan P. J. , McKernan K.J., Malek J. A., Gunaratne P.H., 
Richards S. , Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 
Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 
Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A., 
Whiting M., Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 
Blakesley R.W. , Touchman J.W. , Green E.D., Dickson M.C., 
Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 
Butterfield Y.S.N. , Krzywinski M.I., Skalska U,, Smailus D.E,, 
Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

"Generation and initial analysis of more than 15,000 full-length human 

and mouse cDNA sequences . " ; 

Sroc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 
[2] " — 

NUCLEOTIDE SEQUENCE. 
TISSUE=PCR rescued clones; 
NIH MGC Project; 

Submitted (NOV-2005) to the EMBL/GenBank/DDBJ databases. 



Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 
Distributed under the Creative Commons Attribution-NoDerivs License 



EMBL; BC109114; AAI09115.1; mRNA. 
GO; GO:0031012; C : extracellular matrix; lEA. 
GO; GO: 0004222; F : metal loendopeptidase activity; lEA. 
GO; GO: 0007229; P : integrin-mediated signaling pathway; lEA. 
GO; GO: 0006508; P : proteolysis ; lEA. 
Integrin; Metalloprotease; Protease. 

SEQUENCE 950 AA; 103287 MW; 5DFBE18285CCCC3B CRC64; 

Query Match 99.8%; Score 5156; DB 2; Length 950; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 949; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 


1 


MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 


60 




llllllllllllllllll lllllllllllllllllllllllllllllllllllllllll 




Db 


1 


MLLLGILTLAFAGRTAGGSEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 


60 


Qy 


61 


QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 


120 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


61 


QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 


120 


Qy 


121 


GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 


180 






IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIII 




Db 


121 


GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 


180 


Qy 


181 


NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 


240 






llllllllllllllllllllllilllllllllllllllllllllllllllllllllllll 




Db 


181 


NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 


240 


Qy 


241 


YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 


300 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


241 


YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 


300 


Qy 


301 


KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 


360 






IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 




Db 


301 


KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 


360 


Qy 


361 


HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 


420 






IIIIIIMIIIIIIIIIIIIIIIillllllllllllllllllllllllllllllllllll 




Db 


361 


HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 


420 


Qy 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 



RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RT 
RT 
RL 
RN 
RP 
RC 
RG 
RL 
CC 
CC 
CC 
CC 
DR 
DR 
DR 
DR 
DR 
KW 
SQ 



Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 



QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 540 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 540 

RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllilllllll 
RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNG YNHSTNRLTLA 600 

VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 660 

lllllllllllillllllllllllllllllllllllllllllllllllllllllllllll 
VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 660 

GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 720 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 720 

GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 780 

lllllllllllllllllllllllllllllllllllllllllllllilllillllllllll 
GDDNYLAIiKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 7 80 

VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 840 

lillllMlllllllllillllllllMllllllllllllillllMIIIIIIIIIIIII 

VEVLSVGKMTPPRVRYSFYIiPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 840 
ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 900 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIilll 

ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 900 
AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 

IIIIMIIIIIMIIIillllllllllMMIIIIIIIMIIIIIIIIII 

AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 



RESULT 3 
Q504Z2_MOUSE 

ID Q504Z2_MOl/SH PRELIMINARY; PRT; 950 AA. 

AC Q504Z2; / 

DT 07-JUN-^6'b5, integrated into UniProtKB/TrEMBL. 

DT 07-JUN-B005, sequence version 1. 

DT 07-FEB-2006, entry version 5. 

DE A disinQegrifn-like and metalloprotease {Reprolysin type) with 

DE thrombospojarain type 1 motif, 15. 

GN Naine=Adaintsl5 ; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires ; Glires; Rodent ia; Sciurognathi ; 

OC Muroidea; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=C57BL/6; TI SSUE=Brain; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073 /pnas . 242603899 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G,, 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B. , Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T. , Max S.I.. Wang J., Hsieh F., 

RA Diatchenko L. , Marusina K., Farmer A. A., Rubin G.M., Hong L. , 

RA Stapleton M., Scares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M,J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S-, Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S,, Worley K.C., Hale S., Garcia A.M., Gay L. J. , Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X. , Gibbs R.A., 

RA Fahey J., Helton E. , Ketteman M., Madan A., Rodrigues S. , Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C, 

RA Rodriguez A.C., Grimwood J.. Schmutz J., Myers R.M. , 

RA Butter field Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences . " ; 



RL Proc. Natl. Acad. Sci. U.S.A. 99 ; 16899-16903 ( 2002 ) . 

RN [2] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=C57BL/6; TISSUE=Brain; 

RG NIH MGC Project; 

RL Submitted (MAY-2005) to the EMBL/GenBank/DDBJ databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; BC094677; AAH94677.1; -; mRNA. 

DR Ensembl; ENSMUSG00000033453 ; Mus musculus , 

DR MGI; MGI: 2449569; AdamtslS. 

DR GO; GO:0031012; C : extracellular matrix; lEA. 

DR GO; GO: 0004222; F : metal loendopeptidase activity; lEA. 

DR GO; GO: 0007229; P: integr in-mediated signaling pathway; lEA. 

DR GO; GO: 0006508; P : proteolysis ; lEA. 

DR Inter Pro; IPR006586; ADAM_cysteine. 

DR InterPro; IPR013273; ADAM_TS. 

DR InterPro; IPR013277; ADAM_TS8. 

DR InterPro; IPR006025; Pept_M_Zn_BS. 

DR InterPro; IPR001590; Peptidase_Ml2B. 

DR InterPro; IPR002332; PII_GlnB_UMP_S . 

DR InterPro; IPR000884; TSPl. 

DR InterPro; IPR008085; TSP„1. 

DR Pfam; PF05986; ADAM_spacerl ; 1. 

DR Pfam; PF01562; Pep_Ml2B_propep; 1. 

DR Pfam; PF01421; Reprolysin; 1, 

DR Pfam; PF00090; TSP_1 ; 3. 

DR PRINTS; PR01705; TSPIREPEAT. 

DR SMART; SM00209; TSPl; 3. 

DR PROSITE; PS50215; ADAM_MEPRO; 1. 

DR PROSITE; PS00496; PII_GLNB_UMP; UNKNOWN.l . 

DR PROSITE; PS50092; TSPl; 3. 

DR PROSITE; PS00142; 2 I NC_ PROTEASE; UNKN0WN_1 , 

KW Integr in; Metalloprotease; Protease. 

SQ SEQUENCE 950 AA; 103938 MW; 891926F281E92010 CRC64; 

Query Match 92.8%; Score 4792; DB 2; Length 950; 

Best Local Similarity 92.9%; Pred. No. 0; 

Matches 883; Conservative 22; Mismatches 45; Indels 0; Gaps 0; 
Qy 1 MLLLGILTLAFAGRTAGGFEPEREWVPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

IMIII II I I II III llillll llillll II II Illllllllllllll 

Db 1 MLLLGISILALAWRPAGSSEPEWEWVPIRRDPDINGRHYYRRGTEDSGDQGLIFQITAF 60 

Qy 61 QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

hllllllllllllllllhlhllllll III I lllllllll llllllllllllll 
Db 61 QQDFYLHLTPDAQFLAPAFATEYLGVPLQRLTGSSLDLRRCFYSGYVNAEPDSFAAVSLC 120 

Qy 121 GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

IIIIIIIIIIIIIMIIIIII III llhlllllllllll I IIIIIIIIIIIIMII 
Db 121 GGLRGAFGYRGAEYVISPLPNTSAPEAQRHSQGAHLLQRRGAPVGPSGDPTSRCGVASGW 180 

Qy 181 NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

IMIMIIIMIIM I III :|llllllllllllllllllllllllllllllllllll 

Db 181 NPAILRALDPYKPRRTGAGESHNRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

Qy 241 YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

llllllllllllllllllllllllllllllll llhlllllllllllllllllllllll 
Db 241 YLLTLLATAARLYRHPSILNPINIVWKVLLLGDRDTGPKVTGNAALTLRNFCAWQKKLN 300 

Qy 301 KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 360 

IIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIMIIIIII 

Db 301 KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 360 

Qy 361 HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 420 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIlllllllllllllllllllll 

Db 361 HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 420 



Qy 



421 DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 480 



Illlllllllhlllllll Ihlllllllllllllllllllllllllllllllllllll 

4 2X DCLLDQPSKPITLPEDLPGTSYSLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 



Db 


421 


DCLLDQPSKPITLPEDLPGTSYSLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 


Qy 


481 


QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 


540 






iiiiiiiiiiiiiiiii iiiiiiiiiii 11 ililllili III iliilllllllll 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIIIIIIIII 1 1 : 1 1 1 1 1 1 II h 1 1 1 Iliilllllllll 




Db 


481 


QTRHF PWADGTSCGEGKFCLKGACVERHNPNK YRVDGSWAKWE P YGSCSRTCGGGVQLAR 


540 


Qy 


541 


RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 






III iiliiilliililllllllllllililllllllllllllllllillllllllllll 

II h 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 




Db 


541 


RQCSNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 


Qy 


601 


VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 


660 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 i 1 t 1 1 1 1 1 1 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 




Db 


601 


VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCTPDSTSVCVQGKCIKAGCD 


660 


Qy 


661 


GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 


720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

M 1 1 M 1 * 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


GNLGSKKKFDKCGVCGGDNKSCKRVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 


720 


Qy 


721 


GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 


780 






1 1 1 i 1 1 1 1 1 i i 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 t 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


721 


GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSVLRYSGTGTAVESLQASRPILEPLT 


780 


Qy 


781 


VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 


840 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 i II 1 1 i 1 i 1 1 1 1 1 1 1 1 i 1 1 1 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 : 1 1 1 




Db 


781 


VEVLSVGKMTPPRVRYSFYLPKEPREDKSTRPKDPRGSPVLRNSVLSLSNQVEQPDNRPP 


840 


Qy 


841 


ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACXSEPCPTWELS 


900 






Mllllllllll llllllllllllll 1 Ih III ll|:| :MIIIIIIIII 




Db 


841 


ARWVAGSWGPCSVSCGSGLQKRAVDCRDSPGQQGASACDVDHRPLEKRACGEPCPTWELG 


900 


Qy 


901 


AWS PCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 








lllllllllllhll llllllillllllllhi llllllllllllll 




Db 


901 


NWSPCSKSCGRGFKRRPLKCVGHGGRLLARDQCDLRRKPQELDFCVLRPC 950 





RESULT 4 
Q4RYP9_TETNG 

ID Q4RYP9_TETNG PRELIMINARY; PRT; 882 AA. 

AC Q4RYP9; 

DT 19-JUL-2005, integrated into UniProtKB/TrEMBL. 

DT 19-JUL-2005, sequence version 1. 

DT 07-FEB-2006, entry version 6. 

DE Chromosome 16 SCAF14974, whole genome shotgun sequence. (Fragment). 

GN ORFNames =GSTENG 00026844001; 

OS Tetraodon nigroviridis (Green puffer) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha; Acanthopterygii ; Percomorpha; Tetraodonti formes; 

OC Tetradontoidea; Tetraodontidae; Tetraodon. 

OX NCBI_TaxID=99883; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RX PubMed=15496914; DOI=10 . 1038/nature03025 ; 

RA Jaillon O. , Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N. , 

RA Mauceli E. , Bouneau L. , Fischer C, Ozouf-Costaz C, Bernot A., 

RA Nicaud S., Jaffe D. , Fisher S., Lutfalla G. , Dossat C. , Segurens B. , 

RA Dasilva C. , Salanoubat M. , Levy M. , Boudet N. , Castellano S., 

RA Anthouard V., Jubin C, Castelli V., Katinka M. , Vacherie B., 

RA Biemont C. , Skalli Z., Cattolico L., Poulain J., De Berardinis V., 

RA Cruaud C. , Duprat S., Brottier P., Coutanceau J. -P., Gouzy J., 

RA Parra G., Lardier G. , Chappie McKernan K.J., McEwan P., Bosak S., 

RA Kellis M., Volff J.-N., Guigo R. , Zody M.C., Mesirov J., 

RA Lindblad-Toh K., Birren B. , Nusbaum C, Kahn D., Robinson-Rechavi M. , 

RA Laudet v., Schachter V., Quetier F., Saurin W. , Scarpelli C, 

RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; 

RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals 

RT the early vertebrate proto-karyotype . " ; 

RL Nature 431:946-957(2004). 

RN [2] 

RP NUCLEOTIDE SEQUENCE. 



RG Genoscope; Whitehead Institute Centre for Genome Research; 

RL Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases. 

CC -!- CAUTION: The sequence shown here is derived from an 

CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is 

CC preliminary data. 

CC SIMILARITY: Contains 1 peptidase M12B domain. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; CAAE01014974 ; CAG06483.1; -; Genomic_DNA. 

DR GO; GO: 0031012; C : extracellular matrix; lEA. 

DR GO; GO: 0004222; F : metal loendopeptidase activity; lEA. 

DR GO; GO: 0006508; P : proteolysis ; lEA. 

DR InterPro; IPR013273; ADAM_TS. 

DR InterPro; IPR006025; Pept_M_Zn_BS . 

DR InterPro; IPR001590; Peptidase_M12B. 

DR InterPro; IPR000884; TSPl . 

DR Pfam; PF05986; ADAM_spacerl ; 1. 

DR Pfam; PF01562; Pep„Ml2B_propep; 1. 

DR Pfam; PF01421; Reprolysin; 1. 

DR Pfam; PF00090; TSP^l; 3. 

DR SMART; SM00209; TSPl; 3. 

DR PROSITE; PS50215; ADAM^MEPRO; 1, 

DR PROSITE; PS50092; TSPl; 3. 

DR PROSITE; PS00142; Z I NC_ PROTEASE; UNKNOWN^l . y 

FT NON_TER 11 

SQ SEQUENCE 882 AA; 94853 MW; C374171E3D787E4C CRC64 ; ^ 

Query Match 56.5%; Score 2916.5; DB 2; Length 882; 

Best Local Similarity 56.5%; Pred. No. 9e-203; 

Matches 535; Conservative 125; Mismatches 176; Indels 111; Gaps 13; 

Qy 50 DQGLIFQITAFQEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNA 109 

I |:|:: I I II I Ih III I : H lllllllll 

Db 1 DGRLVFKLATFGRDLYLRLAPDSSFLAAA AGAHVRGCFYSGDVNA 45 

Qy 110 EPDSFAAVSLCGGLRGAFGYRGAEYVISPLPNASAPAAQRNS--QGAHLLQRRGVPGGPS 167 

Hllllhllllll I I MM Ml II: h: II I 
Db 46 DPDSFAALSLCGGLSGGFSLDGAEYSISP GGPPGGRGDPVERTHVISRRRV 96 

Qy 168 GDPTSRCGVASGWNPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWAD 227 

: I MM l|::| II MMI hlMM Mill 

Db 97 ANSTGWCGAAPGGGPAWRG PPR RARRFSSVPRFLEALWAD 138 

Qy 22 8 ESMVKFHGADLEHYLLTLLATAARLYRHPSILNPINIWVKVLLLRDRDSGPKVTGNAAL 287 

M II IIIMMII:! Illllllh IM::|II h: IhhIIIII 
Db 139 ASMAAAHGDDLEHYLLTLMAWARLYRHPSLRNPVSWWGVWTGEAPRGPRVSGNAAL 198 

Qy 288 TLRNFCAWQKKLNKVSDKHPEYWDTAILFTR 318 

MhlhlhlM: MMhMllhllh 

Db 199 TLRDFCSWQRKLNRQSDRHPDHWDTAMLFTQQVRGRGQGERTGVRTGGSTQAGSVSIRRL 258 
Qy 319 QDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTAHELGHVFN 368 

MIIMIIMMIMIMhMhMMhMMIhlll IIMIIIII 

Db 2 59 QVRRLTQVCHQDLCGATTCDTLGMADVGTVCDPQRSCSWEDDGLPTAFTAAHELGHVFN 318 

Qy 369 MPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHGDCLLDQPS 428 

MMIM I MM h MIMMhhl MIMIMhIIIII IMIIIIIM 

Db 319 MPHDNVKACRDVFGTLQ-DHMMSPTLVQVDGGRPWSACSAAILTDFLDRGHGDCLLDRPQ 377 

Qy 429 KPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVCQTRHFPWA 488 

I MM M:M I Ml MM IMIh IMIMII Mil MMMMII 

Db 37 8 KHLSLPAGPPGSAYNLQRQCRLAFGPASRPCPFTPPCSKLWCTGSARGQAVCQTRHFPWA 437 

Qy 489 DGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLARRQCTNPTP 548 

Ml II MM M h MM I I III IIIMMIIIIIIIM II I 

Db 438 DGTGCGGGRVCYRGTCIAKNSTSRPKVNGRWGPWGPYGGCSRSCGGGVQLARRECDNPAP 497 



Qy 



549 ANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLAVAWVPKYS 608 

MIMI IMMIIIhl III MlllhMII III M II M llllll 



Db 



498 ENGGKYCHGLRLKYRSCSLRPCPD--TGKSFRKQQCEEFNGLKLNTTRLGSSVTWVPKyS 555 



Qy 609 GVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCDGNLGSKKR 668 

Ihl hllllllllllllllllllllllll Mil: I:|:||:|||lllll I hh 

Db 556 GVAPEDQCKLICRANGTGYFYVLAPKWDGTPCSPDTPSLCIQGRCIKAGCDGRLNSRKK 615 

Qy 669 FDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLIGDDNYLAL 728 

llllllllhl: lllhl I Ih IIIMI :| l|:::|||lllhh hlllj: 

Db 616 FDKCGVCGGNNQGCKKVSGRFAKPIRGYNFWTLPVGAANVDIRQRGYRGMSSDENYLAV 675 

Qy 729 KNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLTVEVLSVGK 788 

|: :| lllll :|lll lllhhl hlllll Nhhl h lllllhllll 

Db 676 KDCRGHYLLNGGYWSAAERDLLVRGGLMRYSGTSPAVETLRAVAPLQEPLTVELLSVGG 735 

Qy 789 MTPPRVRYSFYLP KEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPPA-RW 843 

lllllllhll: II |: II I : : : I I II 

Db 736 MTPPRVRYTFYVSVATGKEEEEEGRSH NYILEDEGRAEAGRW 777 

Qy 844 VAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELSAWS 903 

III I II :|| lllllll h : |: II lll|: HIIIIIII : II 

Db 778 VAGGWQACSLTCGRGLQKRAVLCQDAEGR— AADCDGAHRPLSERACGEPCPRWTVGTWS 835 

Qy 904 PCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 

III lllllhll ::| I I h:| lllllll I II I 

Db 836 PCSTSCGRGFKRRQVRCAVASGGQLPRERCAGLRKPQELDLCHLRTC 882 

RESULT 5 
ATS15_M0USE 

ID ATS15_M0USE STANDARD; PRT; 562 AA, 

AC P59384; 

DT 28-FEB-2003, integrated into UniProtKB/Swiss-Prot . 

DT 28-FEB-2003, sequence version 1. 

DT 07-MAR-2006, entry version 29. 

DE ADAMTS-15 (EC 3.4.24.-) (A disintegrin and metalloproteinase with 

DE thrombospondin motifs 15) (ADAM-TS 15) (ADAM-TS15) (Fragment) . 

GN Name = Adamt s 1 5 ; 

OS Mus musculus (Mouse) . 

OC Eu)caryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi ; 

OC Muroidea; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [IJ 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA) . 

RC STRAIN=FVB/N; TISSUE=Kidney ; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073 /pnas . 242603899 ; 

RA Strausberg R.L. , Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D. , Collins F.S,, Wagner L., Shenmen CM., Schuler G.D. , 

RA Altschul S.F.. Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K. , Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M., Scares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loguellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.M., 

RA Richards S., Worley K.C., Hale S., Garcia A.M.. Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E. J. , Lu X., Gibbs R.A., 

RA Fahey J., Helton E. , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y.. Bouffard G.G., 

RA Blakesley R.W. , Touchman J.W., Green E.D., Dickson M.C, 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E.. 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences . " ; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- COFACTOR: Binds 1 zinc ion per subunit (By similarity). 

CC -!- SUBCELLULAR LOCATION: Secreted protein; extracellular space; 

CC extracellular matrix (By similarity). 

CC -!- DOMAIN: The spacer domain and the TSP type-1 domains are important 

CC for a tight interaction with the extracellular matrix (By 



CC similarity) . 

CC PTM: The precursor is cleaved by a furin endopeptidase (By 

CC similarity) . 

CC -!- SIMILARITY: Contains 1 disintegrin domain. 

CC -!- SIMILARITY: Contains 1 peptidase M12B domain. 

CC -!- SIMILARITY: Contains 3 TSP type-1 domains. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; BC043308; AAH43308.1; -; mRNA. 

DR HSSP; P07996; ILSL. 

DR MEROPS; M12.025; 

DR Ensembl; ENSMUSG00000033453 ; Mus musculus. 

DR MGI; MGI: 2449569; AdamtslS. 

DR InterPro; IPR006586; ADAM„cysteine. 

DR InterPro; IPR010294; ADAM_spacerl . 

DR InterPro; IPR013273; ADAM_TS. 

DR InterPro; IPR001762; Disintegrin. 

DR InterPro; IPR001818; Pept_M10A_Ml2B. 

DR InterPro; IPR006025; Pept_M_Zn_BS . 

DR InterPro; IPR001590; Peptidase_Ml2B. 

DR InterPro; IPR000884; TSPl . 

DR InterPro; IPR008085; TSP^l . 

DR Pfam; PF05986; ADAM_spacerl ; 1. 

DR Pfam; PF00090; TSP_1; 3. 

DR PRINTS; PR01857 ; ADAMTSFAMILY . 

DR PRINTS; PR01705; TSPIREPEAT. 

DR SMART; SM00209; TSPl; 3. 

DR PROSITE; PS50215; ADAM__MEPRO; PARTIAL. 

DR PROSITE; PS00546; CYSTEINE_SWITCH ; FALSE_NEG. 

DR PROSITE; PS00427; DISINTEGRIN^l ; FALSE_NEG. 

DR PROSITE; PS50214; DISINTEGRIN_2 ; FALSE_NEG. 

DR PROSITE; PS50092; TSPl; 3. 

DR PROSITE; PS00142; ZINC_PROTEASE ; PARTIAL. 

KW Extracellular matrix; Glycoprotein; Hydrolase; Metalloprotease; 



KW 


Protease; 


Repeat ; 


Zinc . 






FT 


CHAIN 


<1 


562 


ADAMTS-15 . 




FT 








/FTId=PRO_0000078212 . 




FT 


DOMAIN 


<1 


39 


Peptidase M12B. 




FT 


DOMAIN 


40 


127 


Disintegrin . 




FT 


DOMAIN 


128 


183 


TSP type-1 1. 




FT 


DOMAIN 


451 


507 


TSP type-1 2. 




FT 


DOMAIN 


508 


561 


TSP type-1 3. 




FT 


REGION 


<1 


39 


Metalloprotease . 




FT 


REGION 


311 


450 


Spacer . 




FT 


COMPBIAS 


184 


312 


Cys-rich. 




FT 


CARBOHYD 


203 


203 


N-linked (GlcNAc. . .) 


(Potential) 


FT 


CARBOHYD 


235 


235 


N-linked (GlcNAc. . .) 


(Potential) 


FT 


CARBOHYD 


291 


291 


N-linked (GlcNAc. . .) 


(Potential) 


FT 


DISULFID 


140 


177 


By similarity. 




FT 


DISULFID 


144 


182 


By similarity. 




FT 


DISULFID 


155 


167 


By similarity. 




FT 


DISULFID 


463 


502 


By similarity. 




FT 


DISULFID 


467 


506 


By similarity. 




FT 


DISULFID 


478 


490 


By similarity. 




FT 


NON_TER 


1 


1 






SQ 


SEQUENCE 


562 AA; 


60964 


MW; 48009AD337D44E4A CRC64; 



Query Match 55.8%; Score 2881; DB 1; Length 562; 

Best Local Similarity 93.3%; Pred. No. 1.9e-200; 

Matches 517; Conservative 16; Mismatches 21; Indels 0; Gaps 0; 
Qy 397 IDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGS 456 

IIMllllllllllllllllllllllllllllilhlllllll Ihllillllllllll 

Db 9 IDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPITLPEDLPGTSYSLSQQCELAFGVGS 68 



Qy 
Db 



457 KPCPYMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVD 516 

IIIIIIIIIIIIMIIIIIIIIIIIIillllllllllllll lllllllllll Ihlll 
69 KPCPYMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKFCLKGACVERHNPNKYRVD 128 



Qy 


517 


Db 


129 


Qy 


577 


Db 


189 


Qy 


637 


Db 


249 


Qy 


697 


Db 


309 


Qy 


757 


Db 


369 


Qy 


817 


Db 


429 


Qy 


877 


Db 


489 


Qy 


937 


Db 


549 



GSWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASG 57 6 

lllllhlll llllllllllllillhllllllllllllllllllllllllllllllll 
GSWAKWEPYGSCSRTCGGGVQLARRQCSNPTPANGGKYCEGVRVKYRSCNLEPCPSSASG 188 

KSFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKW 636 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
KSFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKW 248 

DGTLCSPDSTSVCVQGKCIKAGCDGNliGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGY 696 

|||||:|||lllllllllllllllllllll|:|lllllllllllll|:||||||||llll 
DGTLCTPDSTSVCVQGKCIKAGCDGNLGSKKKFDKCGVCGGDNKSCKRVTGLFTKPMHGY 308 

NFWAI PAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSL 756 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIh 

NFWAI PAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSV 368 
LRYSGTGTAVESLQASRPILEPLTVEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPR 816 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIh mil 

LRYSGTGTAVESLQASRPILEPLTVEVLSVGKMTPPRVRYSFYLPKEPREDKSTRPKDPR 428 

GPSVLHNSVLSLSNQVEQPDDRPPARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVP 876 

I II lllllllllllllhlllllllllllllll Illlllllllllll I Ih 
GSPVLRNSVLSLSNQVEQPDNRPPARWVAGSWGPCSVSCGSGLQKRAVDCRDSPGQQGAS 488 

ACDAAHRPVETQACGEPCPTWELSAWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLH 936 

III llhl :|llllllllll lllllllllllhll lllllllllllllll|:| 
ACDVDHRPLEKRACGEPCPTWELGNWSPCSKSCGRGFKRRPLKCVGHGGRLLARDQCDLR 548 

RKPQELDFCVLRPC 950 

Illlllllllllll 
RKPQELDFCVLRPC 562 



RESULT 6 
Q4S903_TETNG 

ID Q4S903_TETNG PRELIMINARY; PRT; 875 AA. 

AC Q4S903; 

DT 19-JUL-2005, integrated into UniProtKB/TrEMBL. 

DT 19-JUL-2005, sequence version 1. 

DT 07-FEB-2006, entry version 4. 

DE Chromosome 7 SCAF14703, whole genome shotgun sequence. 

GN ORFNames=GSTENG00022114001; 

OS Tetraodon nigroviridis {Green puffer) , 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Actinopterygii ; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha; Acanthopterygii ; Percomorpha; Tetraodonti formes; 

OC Tetradontoidea; Tetraodontidae; Tetraodon. 

OX NCBI_TaxID=99883 ; 

RN lU 

RP NUCLEOTIDE SEQUENCE. 

RX PubMed=15496914; DOI=10 . 1038/nature03025 ; 

RA Jaillon 0., Aury J.-M.. Brunet F. , Petit J.-L., Stange-Thomann N. , 

RA Mauceli E. , Bouneau L., Fischer C. , Ozouf-Costaz C. , Bernot A., 

RA Nicaud S., Jaffe D., Fisher S., Lutfalla G. , Dossat C. , Segurens B., 

RA Dasilva C, Salanoubat M., Levy M. , Boudet N., Castellano S., 

RA Anthouard V., Jubin C, Castelli V., Katinka M., Vacherie B. , 

RA Biemont C, Skalli Z., Cattolico L. , Poulain J., De Berardinis V., 

RA Cruaud C, Duprac S., Brottier P., Coutanceau J. -P., Gouzy J., 

RA Parra G., Lardier G., Chappie C, McKernan K.J., McEwan P., Bosak S., 

RA Kellis M. , Volff J.-N., Guigo R. , Zody M.C., Mesirov J,, 

RA Lindblad-Toh K., Birren B. , Nusbaum C. , Kahn D. , Robinson-Rechavi M. , 

RA Laudet V., Schachter V., Quetier F., Saurin W. , Scarpelli C, 

RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H. ; 

RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals 

RT the early vertebrate proto-karyotype. " ; 

RL Nature 431:946-957(2004). 

RN (2) 

RP NUCLEOTIDE SEQUENCE. 

RG Genoscope; Whitehead Institute Centre for Genome Research; 

RL Submitted (FEB- 2 00 4) to the EMBL/GenBank/DDBJ databases. 

CC -!- CAUTION: The sequence shown here is derived from an 



CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is 

CC preliminary data. 

CC -- 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; CAAE01014703 ; CAG02879.1; -; Genomic_DNA. 

DR GO; GO: 0031012; C : extracellular matrix; lEA. 

DR GO; GO: 0004222; F : metal loendopeptidase activity; lEA. 

DR GO; GO: 0006508; P : proteolysis ; lEA. 

DR InterPro; IPR013273; ADAM_TS. 

DR InterPro; IPR006025; Pept_M_Zn_BS . 

DR InterPro; IPR001590; Peptidase_Ml2B. 

DR InterPro; IPR000884; TSPl . 

DR InterPro; IPR008085; TSP_1 . 

DR Pfam; PF05986; ADAM_spacerl ; 1. 

DR Pfam; PF01562; Pep_Ml2B_propep; 1. 

DR Pfam; PF01421; Reprolysin; 1. 

DR Pfam; PF00090; TSP_1; 3. 

DR PRINTS; PR01705; TSPIREPEAT. 

DR SMART; SM00209; TSPl; 3. 

DR PROSITE; PS50215; ADAM^MEPRO; 1. 

DR PROSITE; PS50092; TSPl; 3. 

DR PROSITE; PS00142; ZINC_PROTEASE; UNKN0WN_1 . 

SQ SEQUENCE 875 AA; 96535 MW; EAB3A9240ED17F58 CRC64; 

Query Match 54.3%; Score 2802; DB 2; Length 875; 

Best Local Similarity 57.0%; Pred. No. 1.9e-194; 

Matches 523; Conservative 130; Mismatches 180; Indels 84; Gaps 14; 

Qy 54 IFQITAFQEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDS 113 

: :| III:: I I I hlh : : I ::|| Mllll Ih: :| 

Db 23 VLKIRAFQQELVLDLHQDFSFIAPSVANQDKFWSPDG--DDTADLSRCFYSGYVNSDRES 80 

Qy 114 FAAVSLCGGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSR 173 

:|hlll llllllh:! II llh I h:: lh::|l : III 

Db 81 YAALSLCKGLRGAFGFQGWEYFISPVQN RSTESAHIIRRR-TNSNQKLNSTSR 132 

Qy 174 CGVASGWNPAILRALDPYKPRRAGFGESRS-RRRSGRAKRFVSIPRYVETLWADESMVK 232 

III: ::| II I : = h MUM hllMIIIIMI llh 

Db 133 CAVDSDLSLKAAKSLAKYKHLEDGSNVTETVLRKIGRAKRFASVPRYVETLWADSSMVQ 192 

Qy 233 FHGADLEHYLLTLLATAARLYRHPSILNPINIWVKVLLLRDRDSGPKVTGNAALTLRNF 292 

llhlhlllllh: IIMhIIIIM III llh::: : I lllhlMhIMM 
Db 193 FHGSDLKHYLLTLMSVAARLYKHPSILNFINIAWKIVIISEEDKGPKVSGNAAMTLRNF 252 

Qy 293 CAWQKKLNKVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDG 352 

I lllllll :|IIIIIIIIIMIIIIIIII|:MIIIIIIMIIIM IIIIIMIIII 
Db 253 CTWQKKLNKNTDKHPEYWDTAILFTRQDLCGASTCDTLGMADVGTMCDSKRSCSVIEDDG 312 

Qy 353 LPSAFTTAHELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIIT 412 

MIMIMMMMMMIMM I :|lllh MIIIMIIIhl :|ll lllllll 

Db 313 LPSAFTTAHELGHVFNMPHDNVKACADVFGKLQDNHMMSPTLIQINRTSPWSPCSAAIIT 372 

Qy 413 DFLDSGHGDCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQ-YCTKLWCT 471 

:MllllhlMIII Ih Ih Mill : :MIMII Mlllh:| I :MII 
Db 373 EFLDSGHGECLLDQPQKPLVLPDVQPGASYGIDRQCELAFGEGSKPCPFIQPPCGRLWCT 432 

Qy 472 GKAKGQMVCQTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRT 531 

Ih I :|l Mllllllll II ::| :| I :| I :|M I II :| MM 
Db 433 GKSNGHLVCMTRHFPWADGTQCGNDQVCEQGICSDR-QLQNAKVDGRWGKWGQFGSCSRT 491 

Qy 532 CGGGVQLARRQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYN 591 

IMIIIhlhl II hlllllhMIIIIIIIII II I I 

Db 492 CGGGVQLSRRECNNPVPSNGGKYCQGVRVKYRSCNLNHCPD KGVN 536 

Qy 592 HSTNRLTLAV--AWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCS POSTS VC 649 

I : h :|: h: Ml IIMM II 

Db 537 LSAEQTAPAISTSWL PRLWDGTPCSPDSTGVC 568 



Qy 



650 VQGKCIKAGCDGNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSI 709 



Db 


569 


Qy 


710 


Db 


629 


Qy 


770 


Db 


689 


Qy 


826 


Db 


742 


Qy 


876 


Db 


799 


Qy 


934 


Db 


859 



llllllllllll :|| hllllhllllll lllhlllllhlllllll :| 
VQGKCIKAGCDGKIGSDKKFDKCGICGGDNKGCKKVSGLFTKPVHGYNFWMLPVGAANI 628 

DIRQRGYKGLIGDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESL 769 

lllllllll: Illll|:|ll hllllh::|ll llh:|l llllllll hi 
DIRQRGYKGMTSDDNYLAVKNSDGRYLLNGNYIVSAGERDIIVKKSLLRYSGTTGLSETL 688 

QASRPILEPLTVEVLSVGKMTPPRVRYSFYLPKEPREDKS SHPKDPRGPSVLHNSV 825 

|: :|: I llllll I :: I I I I : I I I I I I =: :|||: =1 I III 
QSVKPLEEALTVEVLCAGQITPPRIRYSFYLSRQNKEDKNLKKEAHDNSP NSV 741 

LS LSNQVEQPDDRPPARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTV 87 5 

|: I : :: I :|:: | |i :||:|||:|:| j j: 
LAHDGAKENGAKLLKKSYNKENPAPGKWISAVWDKCSVTCGNGLQRRSVQCLKPDGK 798 

PA--CDAAHRPVETQACGEPCPTWELSAWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQC 933 

II II:: :| :|: II III I : I II I I II : I I : II I I I I Ihl 

PATDCDSSQQPSDTRVCGTPCPEWLIGQWSPCSRTCGKGFKRRPLHCKTQSGHSLPRDRC 858 

NLHRKPQELDFCVLRPC 950 

:|||||lll II I 
TGLQKPQELDFCNLRSC 87 5 



RESULT 7 
Q2N1I7_PIG 



ID Q2N1I7_PIG PRELIMINARY; PRT; 947 AA. 

AC Q2N1I7; 

DT 07-FEB-2006, integrated into UniProtKB/TrEMBL. 

DT 07-FEB-2006, sequence version 1. 

DT 07-FEB-2006, entry version 1. 

DE ADAMTSl . 

GN Naine=ADAMTSl; 

OS Sus scrofa (Pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Suina; Suidae; 

OC Sus . 

OX NCBI_TaxID=9823 ; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RA Yue K. , Jiang S. ; 

RT "Sus scrofa DNA for ADAMTSl."; 

RL Submitted (AUG-2005) to the EMBL/GenBank/DDBJ databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; DQ177331; ABA54553.1; -; Genomic_DNA. 

SQ SEQUENCE 947 AA; 103009 MW; 86483441F9AF06D5 CRC64; 



Query Match 4 8.5%; Score 2505; DB 2; Length 947; 

Best Local Similarity 48.9%; Pred. No. 8e-173; 

Matches 484; Conservative 160; Mismatches 246; Indels 100; Gaps 22; 

Qy 1 MLLLGILT--LAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQIT 58 

:MI III I : hhl i= h II:: 
Db 17 LLLLTAATTLLVLQGAHGRPVEEDEELVLP-ALERDL AHGTAHLL— LD 62 

Qy 59 AFQEDFYLHLTPDAQFLAPAFSTEHLG VPLQGLTGGSSDLRRCFYSGDVNAEPDSFA 115 

II I I II III! I: : :| I : : II Mill II :M I 

Db 63 AFGSQLRLELQPDRGFLAPGFTLQTVGRRPGPNASHSDPAGDLAHCFYSGTVNRDPSSAA 122 

Qy 116 AVSLCGGLRGAFGYRGAEYVISPLPNAS APAAQRNSQGA HLLQRRGVPGGP 166 

hill hllll :|ll II I h nil : llhll II 

Db 123 ALSLCEGVRGAFYLQGEEYFIQPAPAAAAVGLAPAAAAGEEPLARPQFHLLRRRRRGGG- 181 

Qy 167 SGDPTSRCGV ASGWNPAILRALDPYKPRRAGFGESRSRRRSG 208 

::||| h I I ::hlll : I 

Db 182 GAKCGVLDDETQLAKDAGSEGEDAAAQWPP QNWEPQRAG QSTGTGSL 228 



Qy 



209 RAKRFVSIPRYVETLWADESMVKFHGADLEHYLLTLLATAARLYRHPSILNPINI\AA^ 268 



I Mill lllll|::|l|:|l :|l|: hllllll : IIIIIUIII : ^^HIII 
Db 229 RKKRFVSSPRYVETMLVADQSMAEFHGSGLKHYLLTLFSVAARLYKHPSIRDSVSLWVK 288 

Qy 269 VLLLRDRDSGPKVTGNAALTLRNFCAWQKKLNKVSDKHPEYWDTAILFTRQDLCGATTCD 328 

:|:: : I I :: II II I I i II II II I : I 11= I M I I I I I I I I I I I I I II 
Db 289 ILVIYEEQKGPEITSNAALTLRNFCNWQKQHNPPSDRDAEHYDTAILFTRQDLCGACyrCD 348 

Qy 329 TLGMADVGTMCDPKRSCSVIEDDGLPSAFTTAHELGHVFNMPHDNVKVCEEVFGKLRANH 388 

IIMIIIIhlll lllllllllll :||llllllllllllll|: : I : 1 I :| 
Db 349 TLGMADVGTVCDPSRSCSVIEDDGLQAAFTTAHELGHVFNMPHDDAEQCASLNGVNRDSH 408 

Qy 389 MMSPTLIQIDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPISLPEDLPGASYTLSQQC 448 

II: I :||: III III :|| I I I : I I I : I I : I : I II II MM I ' ' W 
Db 409 MMASMLSNLDRSQPWSPCSAYMITSFLDNGHGECIiMDKPQSPIQLPSDLPGTLYDANRQC 468 

Qy 449 ELAFGVGSKPCP-YMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKLCLKGACVER 507 

: II II II II Mill : I Mllhllllllllllllll hill: 

Db 469 QFTFGEESKHCPDPASTCTTLWCTGTSGGLLVCQTKHFPWADGTSCGEGKWCVNGKCVNK 528 

Qy 508 HNLNKH RVDGSWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGVRVKYRS 564 

: II I III I hi IIIIIIMII hi II I IIIIIIM Ihlll 

Db 529 TD-KKHFDTPVHGSWGPWGPWGDCSRTCGGGVQYTMRECDNPVPKNGGKYCEGKRVRYRS 587 

Qy 565 CNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKLICRANG 624 

Ihl M : :|hllllllll I :: :: II llhlllhhIIMhl I 

Db 588 CNIEDCPEN-NGKTFREEQCEAHNEFSKASFGSGPPVEWTPKYAGVSPKDRCKLICQAKG 646 

Qy 625 TGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCDGNLGSKKRFDKCGVCGGDNKSCKK 684 

Mhli Mlllll IIIIMIIIIIhhIIIM : MhlMlhllh :IM 

Db 647 IGYFFVLQPKWDGTPCSPDSTSVCVQGQCVKAGCDRIIDSKKKFDKCGICGGNGSTCKK 706 

Qy 685 VTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLNGHFWS 744 

::| I Ih :| II lh:|:::|l :| : ^Mhl : I hill I :| 

Db 707 ISGSVTSAKPGYHDIVTIPTGATNIEVKQRNQRGSRNNGSFLAIKAADGTYILNGDFTLS 766 

Qy 745 AVERDLWKGSLLRYSGTGTAVESLQASRPILEPLTVEVLSVGKMTPPRVRYSFYLPKEP 804 

:hh llhllMh hi ::: h lllh:|hll |:::|:::: h 

Db 767 TLEQDITYKGSVLRYSGSSAALERIRSFSPLKEPLTIQVLTVGNALRPKIKYTYFVKKK- 825 

Qy 805 REDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPPARWVAGSWGPCSASCGSGLQKRAV 864 

II h: : I 11 M II :M hhl I 

Db 826 KES FNAIPTFS EWVIEEWGECSKTCGLGVQRRLV 859 

Qy 865 DCRGSAGQRTVPACDAAH--RPVETQACGE-PCPTWELSAWSPCSKSCGRGFQRRSLKCV 921 

:M II II : I :| h I : III hi lllllhlhh::hhh 
Db 860 ECRDINGQ PASECAKEVKPASTRPCADLPCPHWQLGDWSPCSKTCGKGYKKRTLQCL 916 

Qy 922 GHGGRLLARDQCNLHRKPQE-LDFCVLRPC 950 

II :|l : h :|h :|ll : I 
Db 917 SHDGGVLAHESCDPLKKPKHYIDFCTMAEC 946 



RESULT 8 
ATS1_M0USE 

ID ATS1_M0USE STANDARD; PRT; 968 AA. 

AC P97857; 054768; 

DT 30-MAY-2000. integrated into UniProtKB/Swiss-Prot . 
DT Ol-DEC-2000, sequence version 3. 
DT 07-MAR-2006, entry version 64. 

DE ADAMTS-1 precursor (EC 3.4.24.-) (A disintegrin and metalloproteinase 
DE with thrombospondin motifs 1) (ADAM-TS 1) (ADAM-TSl) . 

ON Name=Adaintsl ; 

OS Mus musculus {Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; 

OC Muroidea; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP NUCLEOTIDE SEQUENCE - 
RC STRAIN=12 9/SvJ; 

RX MEDLINE=98110583; PubMed=9441751 ; DOI=10 . 1006/geno . 1997 . 5064 ; 



RA Kuno K., Lizasa H. , Ohno S., Matsushima K.; 

RT "The exon/intron organization and chromosomal mapping of the mouse 

RT ADAMTS-1 gene encoding an ADAM family protein with TSP motifs."; 

RL Genomics 46:466-471(1997). 

RN [2] 

RP NUCLEOTIDE SEQUENCE. 

RX MEDLINE:=97150761; PubMed=8995297 ; DOI=10 . 1074 /jbc . 272 . 1 . 556 ; 

RA Kuno K.. Kanada N., Nakashima E. , Fujiki F., Ichimura F., 

RA Matsushima K.; 

RT "Molecular cloning of a gene encoding a new type of metal loproteinase- 

RT disintegrin family protein with thrombospondin motifs as an 

RT inflammation associated gene."; 

RL J. Biol. Chem. 272:556-562(1997). 

RN (3J 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA) . 

RC TISSUE=Limb, and Mammary gland; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas . 242603899 ; 

RA Strausberg R.L.. Feingold E.A. , Grouse L.H., Derge J,G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM,, Schuler G.D., 

RA Altschul S.F., Zeeberg B. , Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L. , Marusina K., Farmer A. A., Rubin G.M. , Hong L., 

RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J.. Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J. , McKernan K.J., Malek J.A. , Gunaratne P.H., 

RA Richards S,, Worley K.C., Hale S.. Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M,C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. . 

RA Butter field Y.S.N. , Krzywinski M.I., Skalska U,, Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences . " ; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN [4] 

RP CHARACTERIZATION, AND MUTAGENESIS OF GLU-403 . 

RX MEDLINE=99303657; PubMed=10373500 ; DOI=10 . 1074/ jbc , 274 . 26 . 18821 ; 

RA Kuno K., Terashima Y., Matsushima K.; 

RT "ADAMTS-1 is an active metalloproteinase associated with the 

RT extracellular matrix."; 

RL J. Biol. Chem. 274:18821-18826(1999). 

RN [53 

RP FUNCTION. 

RX MEDLINE=20389568; PubMed=10930576 ; DOI=10 . 1016/S0014-5793 (00) 01854-8; 

RA Kuno K., Okada Y., Kawashima H., Nakamura H., Miyasaka M. , Ohno H., 

RA Matsushima K.; 

RT "ADAMTS-1 cleaves a cartilage proteoglycan, aggrecan."; 

RL FEES Lett. 478:241-245(2000). 

RN [6] 

RP FUNCTION, AND INDUCTION. 

RX MEDLINE=20243757; PubMed=10781075 ; DOI=10 . 1073/pnas . 080073497 ; 

RA Robker R.L., Russell D.L., Espey L.L. , Lydon J. P., O'Malley B.W. , 

RA Richards J . S . ; 

RT "Progesterone-regulated genes in the ovulation process: ADAMTS-1 and 

RT cathepsin L proteases."; 

RL Proc. Natl. Acad. Sci. U.S.A. 97:4689-4694(2000). 

CC FUNCTION: Cleaves aggrecan, a cartilage proteoglycan, and may be 

CC involved in its turnover. Has angiogenic inhibitor activity (By 

CC similarity) . Active metalloprotease, which may be associated with 

CC various inflammatory processes as well as development of cancer 

CC cachexia. May play a critical role in follicular rupture (By 

CC similarity) . 

CC -!- CATALYTIC ACTIVITY: Cleaves aggrecan at the 1691-Glu- | -Leu-1692 

CC site, within the chondroitin sulfate attachment domain, 

CC -!- COFACTOR: Binds 1 zinc ion per subunit (By similarity). 

CC -!- SUBCELLULAR LOCATION: Secreted protein; extracellular space; 

CC extracellular matrix. 

CC -!- INDUCTION: Induced in vitro in colon adenocarcinoma cells by 
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interleukin-1, or in vivo in kidney and heart by 
lipopolysaccharide. Also induced by LH stimulation in granulosa 
cells of preovulatory follicles. 

DOMAIN: The spacer domain and the TSP type-1 domains are important 
for a tight interaction with the extracellular matrix. 
PTM: The precursor is cleaved by a furin endopeptidase. 
SIMILARITY: Contains 1 disintegrin domain. 
SIMILARITY: Contains 1 peptidase M12B domain. 
SIMILARITY: Contains 3 TSP type-1 domains. 
CAUTION: Ref . 2 sequence differs from that shown due to a 
frameshift in position 7. 

Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 
Distributed under the Creative Commons Attribution-NoDerivs License 

EMBL; AB001735; BAA24501.1; ALT_INIT; Genomic_DNA. 

EMBL; D67076; BAA11088.1; ALT_FRAME; mRNA. 

EMBL; BC040382; AAH40382.1; -; mRNA. 

EMBL; BC050834; AAH50834.1; -; mRNA. 

HSSP; P07996; ILSL. 

MEROPS; M12.222; -. 

Ensembl; ENSMUSG00000022893 ; Mus musculus. 

MGI; MGI: 109249; Adamtsl . 

Inter Pro; IPR006586; ADAM_cysteine . 

InterPro; IPR010294; ADAM_spacerl . 

Inter Pro; IPR013273; ADAM^TS. 

InterPro; IPR013274; ADAM_TS1 . 

InterPro; IPR001762; Disintegrin. 

InterPro; IPR006025; Pept_M_Zn_BS . 

InterPro; IPR001590; Peptidase_Ml2B. 

InterPro; I PRO 02 870; Peptidase_Ml2B_N. 

InterPro; IPR000884; TSPl . 

InterPro; IPR008085; TSP_1 . 

Pfam; PF05986; ADAM^spacerl; 1. 

Pfam; PF01562; Pep_Ml2B_propep; 1. 

Pfam; PF01421; Reprolysin; 1. 

Pfam; PF00090; TSP_1; 3. 

PRINTS; PR01858; ADAMTSl . 

PRINTS; PR01857; ADAMTSFAMILY . 

PRINTS; PRO 1705; TSPlREPEAT. 

SMART; SM00608; ACR; 1. 

SMART; SM00209; TSPl; 3. 

PROSITE; PS50215; ADAM__MEPRO; 1. 

PS00427; DISINTEGRIN_1 ; FALSE_NEG. 
PS50214; DISINTEGRIN_2; FALSE_NEG. 
PS50092; TSPl; 3. 
PS00142; ZINC_PROTEASE; 1. 
Extracellular matrix; Glycoprotein; Hepar in-binding; Hydrolase; 
Metal -binding; Metal loprot ease; Protease; Repeat; Signal; Zinc; 
Zymogen . 

Potential . 



PROSITE; 
PROSITE; 
PROSITE; 
PROSITE; 
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/FTId=PRO_0000029152 . 
ADAMTS-1. 

/FTId=PRO_0000029153 . 

Peptidase M12B. 

Disintegrin. 

TSP type-1 1. 

TSP type-1 2. 

TSP type-1 3. 

Spacer . 

Poly-Arg. 

Cys-rich. 



Zinc (catalytic) 
Zinc (catalytic) 
Zinc (catalytic) 



(By similarity) . 
(By similarity) . 
(By similarity) . 



Cysteine switch (Potential) . 
N-linked (GlcNAc. . .) (Potential) 
N-linked (GlcNAc. . .) (Potential) 
N-linked (GlcNAc. . ,) (Potential) 



FT 


CARBOHYD 


783 


783 


N- linked (GlcNAc. . 


.) (Potential: 


FT 


CARBOHYD 


946 


946 


N' linked (GlcNAc. . 


.) (Potential: 


FT 


DISULFID 


380 


463 


By similarity. 




FT 


DISULFID 


418 


447 


By similarity. 




FT 


DISULFID 


572 


609 


By similarity. 




FT 


DISULFID 


576 


614 


By similarity. 




FT 


DISULFID 


587 


599 


By similarity. 




FT 


MUTAGEN 


403 


403 


E->Q: Loss of activity. 


FT 


CONFLICT 


335 


335 


N -> S (in Ref . 2) . 




FT 


CONFLICT 


425 


425 


T -> S (in Ref. 2) . 




SQ 


SEQUENCE 


968 AA; 


105842 


MW; 42EBDA55499FB6C1 


CRC64 ; 


Query Match 




48.1%; 


Score 2482.5; DB 1; 


Length 968; 



Best Local Similarity 48.0%; Pred. No. 3.5e-171; 
Matches 478; Conservative 158; Mismatches 249; Indels 111; Gaps 22; 

Qy 1 MLLLGILTLAFAGRTAGG— FEPEREWVPIRLDPDINGRRYYWRGP-EDSGDQGLIFQI 57 

:||| :|: I I I I : hhl h I I II I :: 

Db 37 LLLLASITMLLCARGAHGRPTEEDEELVLP-SLE RAPGHDSTTTRL--RL 83 

Qy 58 TAFQEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGS SDLRRCFYSGDVNA 109 

II : :|| II: IMI h : : I : II II Hill II 

Db 84 DAFGQQLHLKLQPDSGFLAPGFTLQTV GRSPGSEAQHLDPTGDLAHCFYSGTVNG 138 

Qy 110 EPDSFAAVSLCGGLRGAFGYRGAEYVISPLPNAS APAAQRNSQGA HLLQRRG 161 

:| I IhMI hllll :| h I I I : III I hhll 

Db 139 DPGSAAALSLCEGVRGAFYLQGEEFFIQPAPGVATERLAPAVPEEESSARPQFHILRRR- 197 

Qy 162 VPGGPSGDPTSRCGVASGWNPAILRALDPYKPRRAGFGESRSRRR 206 

I ::||l I I II:: I 

Db 198 RRGSGGAKCGVMD DETLPTSDSRPESQNTRNQWPVRDPTPQDAGKP 243 

Qy 207 SGRAKRFVSIPRYVETLWADESMVKFHGADLEHYLLTLLATAARLYRHPSILNPI 262 

I I lllll lllll|::|l|:M llh hllllll : III hllll I I 
Db 244 SGPGSIRKKRFVSSPRYVETMLVADQSMADFHGSGLKHYLLTLFSVAARFYKHPSIRNSI 303 

Qy 263 NIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLNKVSDKHPEYWDTAILFTRQDLC 322 

::||||:|:: : Ihll llllllllll llh I Ih I h : I I I I I I I I I I I I 
Db 304 SLWVKILVIYEEQKGPEVTSNAALTLRNFCNWQKQHNSPSDRDPEHYDTAILFTRQDLC 363 

Qy 323 GATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTAHELGHVFNMPHDNVKVCEEVFG 382 

h lllllllllllhlll IMIIIIIIII :MIIIIIIIIMIIIIh M : I 
Db 364 GSHTCDTLGMADVGTVCDPSRSCSVIEDDGLQAAFTTAHELGHVFNMPHDDAKHCASLNG 423 

Qy 383 KLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPISLPEDLPGASY 442 

:hh I :| : III III ::| llhllhlhhl II II llll I 
Db 424 VTGDSHLMASMLSSLDHSQPWSPCSAYMVTSFLDNGHGECLMDKPQNPIKLPSDLPGTLY 483 

Qy 443 TLSQQCELAFGVGSKPCP-YMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKLCLK 501 

::|h II M il II lllll : I = I II h I I I I I I II I I I I I I h 

Db 484 DANRQCQFTFGEESKHCPDAASTCTTLWCTGTSGGLLVCQTKHFPWADGTSCGEGKWCVS 543 

Qy 502 GACVERHNLNKH RVDGSWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGV 558 

I II : II I III I hi llllllllll hi II I llllllll 

Db 544 GKCVNKTDM-KHFATPVHGSWGPWGPWGDCSRTCGGGVQYTMRECDNPVPKNGGKYCEGK 602 

Qy 559 RVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKL 618 

Ihllllhl II : :|hllllllll I M llhlllhhill 

Db 603 RVRYRSCNIEDCPDN-NGKTFREEQCEAHNEFSKASFGNEPTVEWTPKYAGVSPKDRCKL 661 

Qy 619 ICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCDGNLGSKKRFDKCGVCGGD 678 

I I I llhll lllllll lllllllllllhhillll : MhlMIIIMh 

Db 662 TCEAKGIGYFFVLQPKWDGTPCSPDSTSVCVQGQCVKAGCDRIIDSKKKFDKCGVCGGN 721 

Qy 679 NKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLN 738 

:|lh:h I Ih :| llllh:h:: I =1 ^Hh: : 1 hll 

Db 722 GSTCKKMSGIVTSTRPGYHDIVTIPAGATNIEVKHRNQRGSRNNGSFLAIRAADGTYILN 781 



Qy 739 GHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLTVEVLSVGKMTPPRVRYSF 798 

hi :| :hll lh:|lllh hi h lllh:|l II I-::- 

Db 782 GNFTLSTLEQDLTYKGTVLRYSGSSAALERIRSFSPLKEPLTIQVLMVGHALRPKIKFTY 841 



Qy 799 YLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPPARWVAGSWGPCSASCGSG 858 

:: |: |:: : I II II II Ulll 

Db 842 FMKKKTES FNAIPTFS EWVIEEWGECSKTCGSG 874 

Qy 859 LQKRAVDCRGSAGQRTVPACDAAH--RPVETQACGE-PCPTWELSAWSPCSKSCGRGFQR 915 

|:| III I II : I :| |: I : III lllllhlh|::: 

Db 875 WQRRWQCRDINGH PASECAKEVKPASTRPCADLPCPHWQVGDWSPCSKTCGKGYKK 931 

Qy 916 RSLKCVGHGGRLLARDQCNLHRKPQE-LDFCVLRPC 950 

hllll I I :|: : h :|h HII I I 
Db 932 RTLKCVSHDGGVLSNESCDPLKKPKHYIDFCTLTQC 967 



RESULT 9 
Q3UOI4_MOUSE 

ID Q3U0I4_MOUSE PRELIMINARY; PRT; 968 AA. 

AC Q3U0I4; 

DT ll-OCT-2005, integrated into UniProtKB/TrEMBL. 

DT ll-OCT-2005, sequence version 1. 

DT 07-FEB-2006, entry version 5. 

DE Activated spleen cDNA, RIKEN full-length enriched library, 

DE clone :F830102H03 product: a disintegrin-like and metal loprotease 

DE (reprolysin type) with thrombospondin type 1 motif, 1, full insert 

DE sequence. 

GN Name=Adamt s 1 ; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires ; Glires; Rodent ia; Sciurognathi ; 

OC Muroidea; Muridae; Murinae; Mus. 

OX NCBI_TaxI D= 10090; 

RN [11 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=NOD; TISSUE= Activated spleen; 

RX MEDLINE=99279253; PubMed=10349636 ; DOI=10 . 1016/S0076-6879 ( 99 ) 03004-9 ; 

RA Carninci P., Hayashizaki Y.; 

RT "High-efficiency full-length cDNA cloning."; 

RL Methods Enzymol, 303:19-44(1999). 

RN [2] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=NOD; TISSUE=Activated spleen; 

RX PubMed=16141072; DOI=10 . 1126/science . 1112014 ; 

RA Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., 

RA Oyama R., Ravasi T, , Lenhard B., Wells C. , Kodzius R. , Shimokawa K., 

RA Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., 

RA Davis M.J., Wilming L.G., Aidinis V., Allen J.E., 

RA Ambesi-Impiombato A., Apweiler R. , Aturaliya R.N. , Bailey T.L., 

RA Bansal M. , Baxter L., Beisel K.W. , Bersano T. , Bono H., Chalk A.M., 

RA Chiu K.P., Choudhary V., Christoffels A,, Clutterbuck D.R., 

RA Crowe M.L., Dalla E., Dalrymple B.P., de Bono B. , Delia Gatta G. , 

RA di Bernardo D. , Down T., Engstrom P., Fagiolini M., Faulkner G. , 

RA Fletcher C.F., Fukushima T. , Furuno M., Futaki S. , Gariboldi M. , 

RA Georgii -Hemming P., Gingeras T.R., Gojobori T. , Green R.E., 

RA Gustincich S., Harbers M. , Hayashi Y., Hensch T.K., Hirokawa N. , 

RA Hill D., Huminiecki L. , lacono M., Ikeo K. , Iwama A., Ishikawa T., 

RA Jakt M., Kanapin A., Kacoh M., Kawasawa Y., Kelso J., Kitamura H., 

RA Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., 

RA Kurochkin I.V., Lareau L.F., Lazarevic D. , Lipovich L. , Liu J., 

RA Liuni S., McWilliam S., Madan Babu M. , Madera M. , Marchionni L. , 

RA Matsuda H., Matsuzawa S., Miki H. , Mignone F., Miyake S., Morris K., 

RA Mottagui-Tabar S., Mulder N. , Nakano N. , Nakauchi H., Ng P., 

RA Nilsson R. , Nishiguchi S., Nishikawa S,, Nori F. , Ohara O. , 

RA Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., 

RA Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M. , 

RA Rost B. , Ruan Y., Salzberg S.L., Sandelin A., Schneider C. , 

RA Schonbach C, Sekiguchi K., Semple C.A. , Seno S,, Sessa L. , Sheng Y., 

RA Shibata Y., Shimada H., Shimada K., Silva D. , Sinclair B. , 

RA Sperling S., Stupka E, , Sugiura K. , Sultana R. , Takenaka Y., Taki K. , 

RA Tammoja K. , Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., 

RA Ueda H.R., van Nimwegen E. , Verardo R. , Wei C.L. , Yagi K., 

RA Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W. , Bult C, 



RA Grimmond S.M., Teasdale R.D.« Liu E.T., Bruslc V., Quackenbush J., 

RA Wahlestedt C. , Mat tick J.S., Hume Kai C, Sasaki D., Tomaru Y., 

RA Pukuda S.. Kaneunori-Katayama M. , Suzuki N. « Aoki J., Arakawa T., 

RA lida J., Imamura K., Itoh M., Kato T. , Kawaji H., Kawagashira N. « 

RA Kawashima T., Kojima M. , Kondo S.« Konno H., Nakano K., Ninomiya N., 

RA Nishio T., Okada M., Plessy C, Shibata K. , Shiraki T. , Suzuki S., 

RA Tagami M., Waki K. , Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., 

RA Hayashizaki Y,; 

RT "The transcriptional landscape of the mammalian genome."; 

RL Science 309:1559-1563(2005). 

RN 13] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=NOD; TISSUE=Act ivated spleen; 

RX PubMed=16141073; DOI=10 . 1126/science, 1112009 ; 

RG RIKEN Genome Exploration Research Group, and Genome Science Group 

RG (Genome Network Core Team) and the FANTOM Consortium; 

RT "Antisense Transcription in the Mammalian Transcriptome . " ; 

RL Science 309:1564-1566(2005). 

RN 14] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=NOD; TISSUE=Act ivated spleen; 

RX MEDLINE=22354683; PubMed=12466851 ; DOI=10 . 1038/nature01266 ; 

RA Okazaki Y . , Furuno M . , Kasukawa T . , Adachi J . , Bono H . , Kondo S . , 

RA Nikaido I., Osato N., Saito R. , Suzuki H., Yamanaka I., Kiyosawa H., 

RA Yagi K., Tomaru Y., Hasegawa Y., Nogami A., Schonbach Gojobori T. , 

RA Baldarelli R. . Hill D.P., Bult C. , Hume D.A. , Quackenbush J., 

RA Schriml L.M. , Kanapin A., Matsuda H., Batalov S., Beisel K.W., 

RA Blake J.A. , Bradt D., Brusic V., Chothia C, Corbani L.E,, Cousins S., 

RA Dalla E., Dragani T.A., Fletcher C.F., Forrest A., Frazer K.S., 

RA Gaasterland T., Gariboldi M., Gissi Godzik A., Gough J., 

RA Grimmond S., Gustincich S., Hirokawa N, , Jackson I.J., Jarvis E.D., 

RA Kanai A., Kawaji H., Kawasawa Y., Kedzierski R.M. , King B.L., 

RA Konagaya A . , Kur ochkin I . V . , Lee Y . , Lenhard B . , Lyons P . A . , 

RA Maglott D.R., Maltais L., Marchionni L., McKenzie L. , Miki H. , 

RA Nagashima T. , Numata K. . Okido T. , Pavan W.J., Pertea G., Pesole G., 

RA Petrovsky N. , Pillai R. , Pontius J.U., Qi D, , Ramachandran S., 

RA Ravasi T., Reed J.C., Reed D.J., Reid J., Ring B.Z., Ringwald M. , 

RA Sandelin A., Schneider C. , Semple C.A., Setou M. , Shimada K., 

RA Sultana R, , Takenaka Y. , Taylor M.S., Teasdale R.D., Tomita M., 

RA Verardo R. , Wagner L. , Wahlestedt C. , Wang Y., Watanabe Y., Wells C. , 

RA Wilming L.G. , Wynshaw-Boris A., Yanagisawa M. , Yang I., Yang L. , 

RA Yuan Z. , Zavolan M. , Zhu Y., Zimmer A., Carninci P., Hayatsu N. , 

RA Hirozane-Kishikawa T. , Konno H,, Nakamura M. , Sakazume N. , Sato K., 

RA Shiraki T. , Waki K. , Kawai J., Aizawa K. , Arakawa T. , Fukuda S., 

RA Hara A., Hashizume W. , Imotani K., Ishii Y., Itoh M. , Kagawa I., 

RA Miyazaki A., Sakai K. , Sasaki D. , Shibata K., Shinagawa A., 

RA Yasunishi A., Yoshino M. , Waterston R., Lander E.S., Rogers J., 

RA Birney E. , Hayashizaki Y.; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs."; 

RL Nature 420:563-573(2002). 

RN [5] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=NOD; TISSUE=Act ivated spleen; 

RX MEDLINE=21085660; PubMed=11217 851 ; 001=10.1038/35055500; 

RA Kawai J., Shinagawa A., Shibata K. , Yoshino M. , Itoh M., Ishii Y., 

RA Arakawa T., Hara A., Fukunishi Y., Konno H. , Adachi J., Fukuda S., 

RA Aizawa K. , Izawa M. , Nishi K., Kiyosawa H., Kondo S., Yamanaka I., 

RA Saito T., Okazaki Y., Gojobori T. , Bono H., Kasukawa T., Saito R. , 

RA Kadota K. , Matsuda H.A. , Ashburner M. , Batalov S., Casavant T, , 

RA Fleischmann W. , Gaasterland T. , Gissi C, King B. , Kochiwa H. , 

RA Kuehl P., Lewis S., Matsuo Y., Nikaido I., Pesole G., Quackenbush J., 

RA Schriml L.M., Staubli F. , Suzuki R. , Tomita M. , Wagner L. , Washio T. , 

RA Sakai K., Okido T., Furuno M., Aono H., Baldarelli R. , Barsh G., 

RA Blake J., Boffelli D. , Bojunga N, , Carninci P., de Bonaldo M.F., 

RA Brownstein M.J., Bult C. , Fletcher C. , Fujita M. , Gariboldi M. , 

RA Gustincich S., Hill D. , Hofmann M, , Hume D.A., Kamiya M., Lee N.H., 

RA Lyons P, , Marchionni L., Mashima J., Mazzarelli J., Mombaerts P., 

RA Nordone P., Ring B. , Ringwald M. , Rodriguez I., Sakamoto N. , 

RA Sasaki H. , Sato K., Schoenbach C. , Seya T. , Shibata Y., Storch K.-F., 

RA Suzuki H., Toyo-oka K., Wang K.H., Weitz C. , Whittaker C, Wilming L. , 



RA Wynshaw-Boris A., Yoshida K. , Hasegawa Y., Kawaji H., Kohtsuki S., 

RA Hayashizaki Y.; 

RT "Functional annotation of a full-length mouse cDNA collection."; 

RL Nature 409:685-690(2001). 

RN [6] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=NOD; TISSUE=Activated spleen; 

RX MEDLINE=20499374; PubMed=11042159 ; DOI=10 . 1101/gr . 145100 ; 

RA Carninci P., Shibata Y., Hayatsu N. , Sugahara Y., Shibata K. , Itoh M. , 

RA Konno H., Okazaki Y., Muramatsu M. , Hayashizaki Y.; 

RT "Normalization and subtraction of cap- trapper- selected cDNAs to 

RT prepare full-length cDNA libraries for rapid discovery of new genes."; 

RL Genome Res. 10:1617-1630(2000). 

RN [7] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=NOD; TISSUE= Activated spleen; 

RX MEDLINE=20530913; PubMed=11076861 ; DOI=10 . 1101/gr . 152600 ; 

RA Shibata K., Itoh M., Aizawa K. , Nagaoka S., Sasaki N. , Carninci P., 

RA Konno H., Akiyama J., Nishi K. , Kitsunai T., Tashiro H., Itoh M., 

RA Sumi N.. Ishii Y., Nakamura S., Hazama M. , Nishine T. , Harada A., 

RA Yamamoto R., Matsumoto H., Sakaguchi S., Ikegami T. , Kashiwagi K. , 

RA Fujiwake S., Inoue K. . Togawa Y., Izawa M., Ohara E., Watahiki M. , 

RA Yoneda Y., Ishikawa T. . Ozawa K. , Tanaka T., Matsuura S., Kawai J., 

RA Okazaki Y., Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.; 

RT "RIKEN integrated sequence analysis (RISA) system-3 84 -format 

RT sequencing pipeline with 384 multicapillary sequencer."; 

RL Genome Res. 10:1757-1771(2000). 

RN [8} 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=NOD; TISSUE=Activated spleen; 

RA Arakawa T. . Carninci P., Fukuda S., Hashizume W. , Hayashida K., 

RA Hori F., lida J., Imamura K., Imotani K. , Itoh M., Kanagawa S., 

RA Kawai J., Kojima M. , Konno H., Murata M. , Nakamura M., Ninomiya N. , 

RA Nishiyori H., Nomura K. , Ohno M. , Sakazume N., Sano H., Sasaki D., 

RA Shibata K., Shiraki T. , Tagami M., Tagami Y. , Waki K., Watahiki A., 

RA Muramatsu M. , Hayashizaki Y.; 

RL Submitted (MAR-2004) to the EMBL/GenBank/DDBJ databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AK156830; BAE33869.1; -; mRNA. 

DR MGI; MGI: 10924 9; Adamtsl . 

DR GO; GO: 0031410; C : cytoplasmic vesicle; IDA. 

DR GO; GO: 0031012; C : extracellular matrix; IDA. 

DR GO; GO: 0008201; F: heparin binding; IDA. 

DR GO; GO: 0001822; P: kidney development; IMP. 

DR GO; GO: 0001542; P: ovulation (sensu Mammalia); IMP. 

DR Inter Pro; I PRO 06 5 86; ADAM_cysteine . 

DR InterPro; IPR010294; ADAM_spacerl . 

DR InterPro; IPR01327 3; ADAM.TS. 

DR InterPro; IPR013274; ADAM_TS1 . 

DR InterPro; IPR006025; Pept_M_Zn_BS . 

DR InterPro; IPR001590; Peptidase_Ml2B. 

DR InterPro; I PRO 02 870; Peptidase_Ml2B_N. 

DR InterPro; IPR000884; TSPl . 

DR InterPro; IPR008085; TSP_1 . 

DR Pfam; PF05986; ADAM_spacerl ; 1. 

DR Pfam; PF01562; Pep_Ml2B_propep; 1. 

DR Pfam; PF01421; Reprolysin; 1. 

DR Pfam; PF00090; TSP_1; 3. 

DR PRINTS; PR01705; TSPIREPEAT. 

DR SMART; SM00608; ACR; 1. 

DR SMART; SM00209; TSPl; 3. 

DR PROSITE; PS50215; ADAM^MEPRO; 1. 

DR PROSITE; PS50092; TSPl; 3. 

DR PROSITE; PS00142; ZINC^PROTEASE; UNKN0WN_1 . 

KW Integrin; Metal loprotease; Protease. 

SQ SEQUENCE 968 AA; 105826 MW; 7FB0B6554984DACA CRC64; 



Query Match 



48.1%; Score 2482.5; DB 2; Length 968; 



Best Local Similarity 48.0%; Pred. No. 3.5e-171; 

Matches 478; Conservative 158; Mismatches 249; Indels 111; Gaps 22; 



Qy 1 MLLLGILTLAFAGRTAGG--FEPEREWVPIRLDPDINGRRYYWRGP~EDSGDQGLIFQI 57 

:||| :h III I : hhl h I I II I 

Db 37 LLLLASITMLLCARGAHGRPTEEDEELVLP-SLE RAPGHDSTTTRL--RL 83 

Qy 58 TAFQEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGS SDLRRCFYSGDVNA 109 

II : :| I Ih MM |: : : I : II II Mill II 

Db 84 DAFGQQLHLKLQPDSGFLAPGFTLQTV GRSPGSEAQHLDPTGDLAHCFYSGTVNG 138 

Qy 110 EPDSFAAVSLCGGLRGAFGYRGAEYVISPLPNAS APAAQRNSQGA HLLQRRG 161 

:| I Ihlll hllll :| |: I I I : III I hhll 

Db 139 DPGSAAALSLCEGVRGAFYLQGEEFFIQPAPGVATERLAPAVPEEESSARPQFHILRRR- 197 

Qy 162 VPGGPSGDPTSRCGVASGWNPAILRALDPYKPRRAGFGESRSRRR 206 

I ::|ll I I Ih: I 

Db 198 RRGSGGAKCGVMD DETLPTSDSRPESQNTRNQWPVRDPTPQDAGKP 243 

Qy 207 SGRAKRFVSIPRYVETLWADESMVKFHGADLEHYLLTLLATAARLYRHPSILNPI 262 

I I Hill lllll|::|||:|| llh hllllll = III hllll I I 
Db 244 SGPGSIRKKRFVSSPRYVETMLVADQSMADFHGSGLKHYLLTLFSVAARFYKHPSIRNSI 303 

Qy 263 NIWVKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLNKVSDKHPEYWDTAILFTRQDLC 322 

::||lhh: Ihll llllllllll llh I Ih I h : I I I I I I I I I I I I 

Db 304 SLVWKILVIYEEQKGPEVTSNAALTLRNFCNWQKQHNSPSDRDPEHYDTAILFTRQDLC 363 

Qy 323 GATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTAHELGHVFNMPHDNVKVCEEVFG 382 

h lllllllllllhlll lllllllllll Hlllllllllllllllh I I : I 
Db 364 GSHTCDTLGMADVGTVCDPSRSCSVIEDDGLQAAFTTAHELGHVFNMPHDDAKHCASLNG 423 

Qy 383 KLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPISLPEDLPGASY 442 

:hh I :| : III III ::| llhlihihhl II II Mil I 
Db 424 VTGDSHLMASMLSSLDHSQPWSPCSAYMVTSFLDNGHGECLMDKPQNPIKLPSDLPGTLY 483 

Qy 443 TLSQQCELAFGVGSKPCP-YMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKLCLK 501 

::|h II II II II lllll : I : I I I h I I i I I I I I I I I I I I h 

Db 484 DANRQCQFTFGEESKHCPDAASTCTTLWCTGTSGGLLVCQTKHFPWADGTSCGEGKWCVS 543 

Qy 502 GACVERHNLNKH RVDGSWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGV 558 

I II : :: II I III I hi llllllllll hi II I llllllll 

Db 544 GKCVNKTDM-KHFATPVHGSWGPWGPWGDCSRTCGGGVQYTMRECDNPVPKNGGKYCEGK 602 

Qy 559 RVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKL 618 

Ihllllhl II : :|hllllllil I II llhlllhhill 

Db 603 RVRYRSCNIEDCPDN-NGKTFREEQCEAHNEFSKASFGNEPTVEWTPKYAGVSPKDRCKL 661 

Qy 619 ICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCDGNLGSKKRFDKCGVCGGD 678 

I I I llhll lllllll IIIIMIIIIIhhIIIII : llhllllllllh 
Db 662 TCEAKGIGYFFVLQPKWDGTPCSPDSTSVCVQGQCVKAGCDRIIDSKKKFDKCGVCGGN 721 

Qy 679 NKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLN 738 

:|||::|: I Ih =1 llllh^::: I :| = ::|h: : I hll 

Db 722 GSTCKKMSGIVTSTRPGYHDIVTIPAGATNIEVKHRNQRGSRNNGSFLAIRAADGTYILN 781 

Qy 73 9 GHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLTVEVLSVGKMTPPRVRYSF 798 

hi :| :|:|| IhUlllh hi : = = h IllhUI II h::::: 
Db 7 82 GNFTLSTLEQDLTYKGTVLRYSGSSAALERIRSFSPLKEPLTIQVLMVGHAPRPKIKFTY 841 

Qy 799 YLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPPARWVAGSWGPCSASCGSG 858 

:: h h: M II II II Ulll 

Db 842 FMKKKTES FNAIPTFS EWVIEEWGECSKTCGSG 874 

Qy 859 LQKRAVDCRGSAGQRTVPACDAAH--RPVETQACGE-PCPTWELSAWSPCSKSCGRGFQR 915 

hi III I II : I U h I : III h: lllllhlhh:: 

Db 87 5 WQRRWQCRDINGH PASECAKEVKPASTRPCADLPCPHWQVGDWSPCSKTCGKGYKK 931 

Qy 916 RSLKCVGHGGRLLARDQCNLHRKPQE-LDFCVLRPC 950 

hllll I I :h : h :|h Hll I I 
Db 932 RTLKCVSHDGGVLSNESCDPLKKPKHYIDFCTLTQC 967 



RESULT 10 
ATSl^HUMAN 

ID ATS1_HUMAN STANDARD; PRT; 967 AA. 

AC Q9UHI8; Q9NSJ8; Q9P2K0; Q9UH83 ; Q9UP80; 

DT 30-MAy-2000, integrated into UniProtKB/Swiss-Prot . 

DT Ol-DEC-2000, sequence version 3. 

DT 07-MAR-2006, entry version 60. 

DE ADAMTS-1 precursor {EC 3.4.24.-) (A disintegrin and metalloproteinase 

DE with thrombospondin motifs 1) (ADAM-TS 1) (ADAM-TSl) (METH-1) . 

GN Name=ADAMTSl; Synonyins=KIAA1346 , METHl; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; 

OC Homo . 

OX NCBI_TaxID=9606 ; 

RN [1] 

RP ^fUCLEOTIDE SEQUENCE [MRNA] . 

RA Casas C, Pritchard M.A., Estivill X., Arbones M.L.; 

RT "Cloning, characterization and mapping on human chromosome 21 of the 

RT orthologue of murine Adamts-1."; 

RL Submitted {JUL-1999) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP NUCLEOTIDE SEQUENCE [MRNA] , AND FUNCTION. 

RC TISSUE=Heart; 

RX MEDLINE=99367466; PubMed=10438512 ; DOI=10 . 1074/ jbc . 274 . 33 . 23349 ; 

RA Vazquez F., Hastings G., Ortega M.-A., Lane T.F., Oikemus S., 

RA Lombardo M. , Iruela-Arispe M.L. ; 

RT "METH-1, a human ortholog of ADAMTS-1, and METH-2 are members of a new 

RT family of proteins with angio- inhibitory activity."; 

RL J. Biol. Chem. 274:23349-23357(1999). 

RN 13} 

RP NUCLEOTIDE SEQUENCE [MRNA] . 

RC TISSUE=Endothelial cell; 

RX MEDLINE=20247184; PubMed=10785405 ; 

RA Glienke J., Schmitt A.O. , Pilarsky C. , Hinzmann B,, Weiss B., 

RA Rosenthal A., Thierauch K.H.; 

RT "Differential gene expression by endothelial cells in distinct 

RT angiogenic states."; 

RL Eur. J. Biochem. 267:2820-2830(2000). 

RN [4] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] . 

RC TISSUE=Brain; 

RX MEDLINE=20181126; PubMed=107181 98 ; DOI=10 . 1093 /dnares/7 . 1 . 65 ; 

RA Nagase T. , Kikuno R. , Ishikawa K. , Hirosawa M. , Ohara O.; 

RT "Prediction of the coding sequences of unidentified human genes, XVI. 

RT The complete sequences of 150 new cDNA clones from brain which code 

RT for large proteins in vitro."; 

RL DNA Res. 7:65-73(2000). 

RN [5] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 

RX MEDLINE=20289799; PubMed=10830953 ; DOI=10 . 1038/35012518; 

RA Hat tori M. , Fujiyama A,, Taylor T.D., Watanabe H., Yada T. , 

RA Park H.-S., Toyoda A., Ishii K, , Totoki Y., Choi D.-K., Groner Y., 

RA Soeda E., Ohki M. , Takagi T, , Sakaki Y., Taudien S. , Blechsclimidt K., 

RA Polley A., Menzel U. , Delabar J., Kumpf K., Lelimann R. , Patterson D. , 

RA Reichwald K., Rump A., Schillhabel M. , Schudy A., Zimmermann W. , 

RA Rosenthal A., Kudoh J,, Shibuya K. , Kawasaki K., Asakawa S., 

RA Shintani A., Sasaki T. , Nagamine K. , Mitsuyama S., Antonarakis S.E., 

RA Minoshima S., Shimizu N. , Nordsiek G. , Hornischer K., Brandt P., 

RA Scharfe M. , Schoen O., Desario A,, Reichelt J., Kauer G. , Bloecker H. , 

RA Ramser J., Beck A., Klages S., Hennig S., Riesselmann L. , Dagand E. , 

RA Wehrmeyer S., Borzym K. , Gardiner K. , Nizetic D. , Francis F., 

RA Lehrach H., Reinhardt R. , Yaspo M.-L.; 

RT "The DNA sequence of human chromosome 21."; 

RL Nature 405:311-319(2000). 

RN [6] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 418-967. 

RC TISSUE=Melanoma; 

RG The German cDNA consortium; 

RL Submitted (MAR-2000) to the EMBL/GenBank/DDBJ databases. 



CC -!- FUNCTION: Cleaves aggrecan, a cartilage proteoglycan, and may be 

CC involved in its turnover (By similarity) . Has angiogenic inhibitor 

CC activity. Active metal 1 ©protease, which may be associated with 

CC various inflammatory processes as well as development of cancer 

CC cachexia. May play a critical role in follicular rupture. 

CC -!- CATALYTIC ACTIVITY: Cleaves aggrecan at the 1938-Glu- | -Leu-1939 

CC site, within the chondroitin sulfate attachment domain. 

CC -!- COFACTOR: Binds 1 zinc ion per subunit (By similarity). 

CC -!- SUBCELLULAR LOCATION: Secreted protein; extracellular space; 

CC extracellular matrix (By similarity) . 

CC -!- DOMAIN: The spacer domain and the TSP type-1 domains are important 

CC for a tight interaction with the extracellular matrix. 

CC -!- PTM: The precursor is cleaved by a furin endopeptidase (By 

CC similarity) . 

CC -!- SIMILARITY: Contains 1 disintegrin domain. 

CC -!- SIMILARITY: Contains 1 peptidase M12B domain. 

CC -!- SIMILARITY: Contains 3 TSP type-1 domains. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AF170084; AAF15317.1; -; mRNA. 

DR EMBL; AF060152; AAD48080.1; ALT_INIT; mRNA. 

DR EMBL; AF207664; AAF23772.1; -; mRNA. 

DR EMBL; AB037767; BAA92584.1; ALT^INIT; mRNA. 

DR EMBL; AP001697; BAA95502.1; -; Genomic_DNA. 

DR EMBL; AL162080; CAB82413.1; -; mRNA. 

DR PIR; T47158; T47158. 

DR HSSP; P07 996; ILSL . 

DR MEROPS; M12.222; 

DR Ensembl; ENSG00000154734 ; Homo sapiens. 

DR H-InvDB; HIX0016042; -. 

DR HGNC; HGNC:217; ADAMTSl . 

DR MIM; 605174; gene. 

DR GO; GO: 0008237; F : metal lopeptidase activity; TAS. 

DR GO; GO: 0007229; P : integr in-mediated signaling pathway; TAS. 

DR GO; GO: 0008285; P: negative regulation of cell proliferation; TAS. 

DR Inter Pro; IPR006586; ADAM_cysteine. 

DR InterPro; IPR010294; ADAM_spacerl . 

DR InterPro; IPR013273; ADAM_TS. 

DR InterPro; IPR013274; ADAM_TS1 . 

DR InterPro; IPR001762; Disintegrin. 

DR InterPro; IPR006025; Pept_M_Zn_BS . 

DR InterPro; IPR001590; Peptidase_Ml2B . 

DR InterPro; I PRO 02 870; Peptidase_Ml2B__N . 

DR InterPro; IPR000884; TSPl. 

DR InterPro; IPR008085; TSP_1 . 

DR Pfam; PF05986; ADAM_spacerl ; 1. 

DR Pfam; PF01562; Pep_Ml2B_propep; 1. 

DR Pfam; PF01421; Reprolysin; 1. 

DR Pfam; PF00090; TSP_1; 3. 

DR PRINTS; PR01858; ADAMTSl. 

DR PRINTS; PR01857; ADAMTSFAMILY . 

DR PRINTS; PR017 05; TSPl RE PEAT, 

DR SMART; SM00608; ACR; 1. 

DR SMART; SM00209; TSPl; 3. 

DR PROSITE; PS50215; ADAM_MEPRO; 1. 

DR PROSITE; PS00427; DISINTEGRIN_1 ; FALSE_NEG. 

DR PROSITE; PS50214; DISINTEGRIN_2 ; FALSE_NEG. 

DR PROSITE; PS50092; TSPl; 3. 

DR PROSITE; PS00142; ZINC_PROTEASE; 1. 

KW Extracellular matrix; Glycoprotein; Heparin-binding; Hydrolase; 

KW Metal -binding; Metalloprotease; Protease; Repeat; Signal; Zinc; 

KW Zymogen , 

FT SIGNAL 1 49 Potential. 

FT PROPEP 50 252 By similarity. 

FT /FTId=PRO_0000029150. 

FT CHAIN 253 967 ADAMTS-1. 

FT /FTId=PRO_0000029151 . 

FT DOMAIN 258 467 Peptidase M12B. 

FT DOMAIN 476 559 Disintegrin. 



FT 


DOMAIN 


559 


614 


TSP type-1 1. 




FT 


DOMAIN 


854 


905 


TSP type-1 2. 




FT 


DOMAIN 


908 


967 


TSP type-1 3. 




FT 


REGION 


725 


849 


Spacer . 




FT 


COMPBIAS 


617 


724 


Cys-rich. 




FT 


COMPBIAS 


843 


846 


Poly-Lys - 




FT 


ACT_SITE 


402 


402 


By similarity. 




FT 


METAL 


401 


401 


Zinc {catalytic) 


(By similarity) . 


FT 


METAL 


405 


405 


Zinc ^catalytic) 


(By similarity) . 


FT 


METAL 


411 


411 


Zinc (catalytic) 


(By similarity) . 


FT 


SITE 


198 


198 


Cysteine switch 


(Potential) . 


FT 


CARBOHYD 


547 


547. 


N-linked (GlcNAc 


. . .) (Potential) 


FT 


CARBOHYD 


720 


720 


N-linked (GlcNAc 


. . .) (Potential) 


FT 


CARBOHYD 


764 


764 


N-linked (GlcNAc. . .) (Potential) 


FT 


DISULFID 


379 


462 


By similarity. 




FT 


DISULFID 


417 


446 


By similarity. 




FT 


DISULFID 


571 


608 


By similarity. 




FT 


DISULFID 


575 


613 


By similarity. 




FT 


DISULFID 


586 


598 


By similarity. 




FT 


CONFLICT 


227 


227 


P -> A (in Ref. 


4 and 5) . 


FT 


CONFLICT 


468 


468 


Q -> H (in Ref. 


1) . 


FT 


CONFLICT 


561 


561 


S -> N (in Ref. 


1) . 


SQ 


SEQUENCE 


967 AA; 


105384 


MW; C189389324741ED1 CRC64; 



Query Match 48.0%; Score 2480.5; DB 1; Length 967; 

Best Local Similarity 48.9%; Pred. No. 5e-171; 

Matches 485; Conservative 154; Mismatches 251; Indels 101; Gaps 24; 

Qy 1 MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

:||| II : I : hill h 111 II 
Db 36 LLLLAAALLAVSDALGRPSEEDEELWP-ELE RAP GHGTTRLRLHAF 81 



Qy 61 QEDFYLHLTPDAQFLAPAFSTEHLG VPLQGLTGGSSDLRRCFYSGDVNAEPDSF 114 

: I I II: MM |: :::| II Ml Mill II M I 

Db 82 DQQLDLELRPDSSFLAPGFTLQNVGRKSGSETPLP ETDLAHCFYSGTVNGDPSSA 136 

Qy 115 AAVSLCGGLRGAFGYRGAEYVISPLPNAS APAAQRNSQGA HLLQ--RRGVPGG 165 

MMII IMIII I I I III II I II I llh IM M 

Db 137 AALSLCEGVRGAFYLLGEAYFIQPLPAASERLATAAPGEKPPAPLQFHLLRRNRQGDVGG 196 



Qy 166 PSG DPTSRC GVASG---WNPAILRALDPYKPRRAGFGESRSRRRS 207 

I 11: III: I I I |: : I 

Db 197 TCGWDDEPRPTGKAETEDEDEGTEGEDEGPQWS PQDPALQGVGQP-TGTGS 247 



Qy 208 GRAKRFVSIPRYVETLWADESMVKFHGADLEHYLLTLLATAARLYRHPSILNPINIVW 267 

I Mill Mll|::|l|:|l Mlh hlllMI : IMIhllM I :::|ll 
Db 248 IRKKRFVSSHRYVETMLVADQSMAEFHGSGLKHYLLTLFSVAARLYKHPSIRNSVSLVW 307 



Qy 268 KVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLNKVSDKHPEYWDTAILFTRQDLCGATTC 327 

|:|:: I IhM IIIMMMI MM I M: |::|IIIIIMIIII|: M 
Db 308 KILVIHDEQKGPEVTSNAALTLRNFCNWQKQHNPPSDRDAEHYDTAILFTRQDLCGSQTC 367 



Qy 328 DTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTAHELGHVFNMPHDNVKVCEEVFGKLRAN 387 

MIIMMIhIM MIIMMIM :|IMMMMIIIIII|: 11:1=: 
Db 368 DTLGMADVGTVCDPSRSCSVIEDDGLQAAFTTAHELGHVFNMPHDDAKQCASLNGVNQDS 427 



Qy 388 HMMSPTLIQIDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPISLPEDLPGASYTLSQQ 447 

III: I :| : III III :|l MhllMIMhl II II MM II ::| 
Db 428 HMMASMLSNLDHSQPWSPCSAYMITSFLDNGHGECLMDKPQNPIQLPGDLPGTSYDANRQ 487 



Qy 448 CELAFGVGSKPCP-YMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKLCLKGACVE 506 

|: II II II |: Mill : I :|llhlllllllllMIII h I II 

Db 488 CQFTFGEDSKHCPDAASTCSTLWCTGTSGGVLVCQTKHFPWADGTSCGEGKWCINGKCVN 547 

Qy 507 RHNLNKH RVDGSWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGVRVKYR 563 

: : II III I hi IIIIIMIII hi II I llllllll Ihll 

Db 548 KTD-RKHFDTPFHGSWGMWGPWGDCSRTCGGGVQYTMRECDNPVPKNGGKYCEGKRVRYR 606 



Qy 564 SCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKLICRAN 623 

Mill M : :||:|lilllll I II h I I h I M h h M I II : I 

Db 607 SCNLEDCPDN-NGKTFREEQCEAHNEFSKASFGSGPAVEWIPKYAGVSPKDRCKLICQAK 665 



Qy 624 GTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCDGNLGSKKRFDKCGVCGGDNKSCK 683 

I lllllll lllllllllllhhlllll : llhllllllllh :|l 

Db 666 GIGYFFVLQPKWDGTPCSPDSTSVCVQGQCVKAGCDRIIDSKKKFDKCGVCGGNGSTCK 725 



Qy 684 KVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLNGHFW 743 

|::| I II: :: II ||::h::|| :| : ::||:| : I hill : : 

Db 726 KISGSVTSAKPGYHDIITIPTGATNIEVKQRNQRGSRWNGSFLAIKAADGTYILNGDYTL 785 

Qy 744 SAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLTVEVLSVGKMTPPRVRYSFYLPKE 803 

I ;|:|:: II :|||||: |:|::: h lllhHhIl |:::|:::: |: 

Db 786 STLEQDIMYKGWLRYSGSSAALERIRSFSPLKEPLTIQVLTVGNALRPKIKYTYFVKKK 845 

Qy 804 PREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPPARWVAGSWGPCSASCGSGLQKRA 863 

II |:: : I II II II II I hi 

Db 846 KES FNAIPTFS AWVIEEWGECSKSCELGWQRRL 878 

Qy 864 VDCRGSAGQRTVPACDAAH--RPVETQACGE-PCPTWELSAWSPCSKSCGRGFQRRSLKC 920 

hll II II : I :| h I: III hi M I I h I h h : = I I I I I 
Db 879 VECRDINGQ PASECAKEVKPASTRPCADHPCPQWQLGEWSSCSKTCGKGYKKRSLKC 935 

Qy 921 VGHGGRLLARDQCNLHRKPQE-LDFCVLRPC 950 

: I I :h : h Oh :|ll = I 
Db 936 LSHDGGVLSHESCDPLKKPKHFIDFCTMAEC 966 

RESULT 11 
Q5HYL0_HUMAN 

ID Q5HYL0_HUMAN PRELIMINARY; PRT; 967 AA. 

AC Q5HYL0; 

DT 15-FEB-2005, integrated into UniProtKB/TrEMBL. 

DT 15-FEB-2005, sequence version 1. 

DT 07-FEB-2006, entry version 4. 

DE Hypothetical protein DKFZp686E01144 . 

GN Naine=DKFZp686E01144; 

OS Homo sapiens (Human) , 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Euarchontoglires ; Primates; Catarrhini; Hominidae; 

OC Homo . 

OX NCBI_TaxID=9606; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Colon endothel; 

RG The German cDNA Consortium; 

RA Wambutt R. , Heubner D., Mewes H.W., Weil B. , Amid C, Osanger A., 

RA Fobo G . , Han M . , Wiemann S . ; 

RL Submitted (JAN-2005) to the EMBL/GenBank/DDBJ databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http: //www. uniprot , org /terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; BX6473 88; CAI46043.1; mRNA . 

DR Ensembl; ENSG00000154734 ; Homo sapiens. 

DR GO; GO: 0031012; C : extracellular matrix; lEA. 

DR GO; GO: 0004222; F : metal loendopeptidase activity; lEA. 

DR GO; GO: 0006508; P : proteolysis ; lEA. 

DR InterPro; IPR006586; ADAM_cysteine . 

DR InterPro; IPR010294; ADAM_spacerl . 

DR InterPro; IPR01327 3; ADAM_TS . 

DR InterPro; IPR01327 4; ADAM_TS1 , 

DR InterPro; IPR001128; Cytochrome_P450 . 

DR InterPro; IPR006025; Pept_M_Zn_BS . 

DR InterPro; IPR001590; Peptidase_Ml2B . 

DR InterPro; IPR002 87 0; Peptidase_Ml2B_N. 

DR InterPro; IPR000884; TSPl. 

DR InterPro; IPR008085; TSP_1, 

DR Pfam; PF05986; ADAM_spacerl ; 1. 

DR Pfam; PF01562; Pep_Ml2B_propep; 1. 

DR Pfam; PF01421; Reprolysin; 1. 

DR Pfam; PF00090; TSP_1; 3. 

DR PRINTS; PR01705; TSPl REPEAT. 



SMART; SM00608; ACR; 1, 
SMART; SM00209; TSPl; 3. 
PROSITE; PS50215; ADAM_MEPRO; 1. 
PROSITE; PS00086; CYTOCHROME_P450 ; UNKNOWN^l . 
PROSITE; PS50092; TSPl; 3. 

PROSITE; PS00142; ZINC^PROTEASE ; UNKNOWN..! . 
Hypothetical protein. 

SEQUENCE 967 AA; 105354 MW; 24685FFB095A9497 CRC64; 

Query Match 48.0%; Score 2480.5; DB 2; Length 967; 

Best Local Similarity 48.9%; Pred. No. 5e-171; 

Matches 485; Conservative 154; Mismatches 251; Indels 101; Gaps 24; 



Qy 1 MLLLGILTLAFAGRTAGGFEPEREWVPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

:|l| II : I : hill h ill II 
Db 36 LLLLAAALLAVSDALGRPSEEDEELWP-ELE RAP GHGTARLRLHAF 81 



Qy 61 QEDFYLHLTPDAQFLAPAFSTEHLG VPLQGLTGGSSDLRRCFYSGDVNAEPDSF 114 

: I I II: MM I: :::| II Ml Mill II M I 

Db 82 DQQLDLELRPDSSFLAPGFTLQNVGRKSGSETPLP ETDLAHCFYSGTVNGDPSSA 136 

Qy 115 AAVSLCGGLRGAFGYRGAEYVISPLPNAS---APAAQRNSQGA HLLQ--RRGVPGG 165 

IIMII IMIII I I I III II III I III: hi II 

Db 137 AALSLCEGVRGAFYLLGEAYFIQPLPAASERLATAAPGEKPPAPLQFHLLRRNRQGDVGG 196 



Qy 166 PSG DPTSRC GVASG--WNPAILRALDPYKPRRAGFGESRSRRRS 207 

I II : III: I I I h : I 

Db 197 TCGWDDEPRPTGKAETEDEDEGTEGEDEGPQWS PQDPALQGVGQP-TGTGS 247 



Qy 208 GRAKRFVSIPRYVETLWADESMVKFHGADLEHYLLTLLATAARLYRHPSILNPINIVW 267 

I Mill IIMh:|lhll :|lh hllllM : llllhllli I :::MI 
Db 248 IRKKRFVSSHRYVETMLVADQSMAEFHGSGLKHYLLTLFSVAARLYKHPSIRNSVSLVW 307 

Qy 268 KVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLNKVSDKHPEYWDTAILFTRQDLCGATTC 327 

hh: I Ihll llllllllll III: I Ih |::|IIIMIIIIIIh II 
Db 308 KILVIHDEQKGPEVTSNAALTLRNFCNWQKQHNPPSDRDAEHYDTAILFTRQDLCGSQTC 367 

Qy 328 DTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTAHELGHVFNMPHDNVKVCEEVFGKLRAN 387 

MMMMIhlM lllllllllll :|lllllllllllllll|: I I : I : : 
Db 368 DTLGMADVGTVCDPSRSCSVIEDDGLQAAFTTAHELGHVFNMPHDDAKQCASLNGVNQDS 427 



Qy 388 HMMSPTLIQIDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPISLPEDLPGASYTLSQQ 447 

Mh I :| : Ml III :M llhllhlhhl II II MM II \ 
Db 428 HMMASMLSNLDHSQPWSPCSAYMITSFLDNGHGECLMDKPQNPIQLPGDLPGTSYDANRQ 487 

Qy 448 CELAFGVGSKPCP-YMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKLCLKGACVE 506 

h II II II h lllll : I :|llhllllllllllllll h I II 

Db 4 88 CQFTFGEDSKHCPDAASTCSTLWCTGTSGGVLVCQTKHFPWADGTSCGEGKWCINGKCVN 547 

Qy 507 RHNLNKH RVDGSWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGVRVKYR 563 

: : II III I hi IIIIMIIM hi II I IIIIIMI Ihll 

Db 548 KTD-RKHFDTPFHGSWGMWGPWGDCSRTCGGGVQYTMRECDNPVPKNGGKYCEGKRVRYR 606 



Qy 564 SCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKLICRAN 623 

Mill II : :|hlllMIII I :: := II h I I h II I h h II I I M I 

Db 607 SCNLEDCPDN-NGKTFREEQCEAHNEFSKASFGSGPAVEWIPKYAGVSPKDRCKLICQAK 665 

Qy 624 GTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCDGNLGSKKRFDKCGVCGGDNKSGK 683 

I MhM Mlllll lllllllllllhhillll : MhlMMMIh :|l 

Db 666 GIGYFFVLQPKWDGTPCSPDSTSVCVQGQCVKAGCDRIIDSKKKFDKCGVCGGNGSTCK 725 

Qy 684 KVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLNGHFW 743 

\::\ I Ih :: II lh:h:MI :| : ::|hl : I IMM = : 

Db 726 KISGSVTSAKPGYHDIITIPTGATNIEVKQRNQRGSRNNGSFLAIKAADGTYILNGDYTL 785 

Qy 744 SAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLTVEVLSVGKMTPPRVRYSFYLPKE 803 

I :hh: M :|lll|: hi h lllh:|hll h::h::: h 

Db 786 STLEQDIMYKGWLRYSGSSAALERIRSFSPLKEPLTIQVLTVGNALRPKIKYTYFVKKK 845 

Qy 804 PREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPPARWVAGSWGPCSASCGSGLQKRA 863 

II h: : I II II II II I hi 



DR 
DR 
DR 
DR 
DR 
DR 
KW 
SQ 



Db 846 KES FNAIPTFS AWVIEEWGECSKSCELGWQRRL 878 

Qy 864 VDCRGSAGQRTVPACDAAH--RPVETQACGE-PCPTWELSAWSPCSKSCGRGFQRRSLKC 920 

|:|l II II : I :| h I : III hi II I I I : II : I : : = I I I I I 
Db 879 VECRDINGQ PASECAKEVKPASTRPCADHPCPQWQLGEWSSCSKTCGKGYKKRSLKC 935 

Qy 921 VGHGGRLLARDQCNLHRKPQE-LDFCVLRPC 950 

: I I :|: : |: :||: :||| : | 
Db 936 LSHDGGVLSHESCDPLKKPKHFIDFCTMAEC 966 



RESULT 12 
Q5U261_XENLA 

ID Q5U261_XENLA PRELIMINARY; PRT; 928 AA. 

AC Q5U261; 

DT 07-DEC-2004, integrated into UniProtKB/TrEMBL. 

DT 07-DEC-2004, sequence version 1. 

DT 07-FEB-2006, entry version 10. 

DE LOC495679 protein. 

GN Name=LOC495679; 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Amphibia; Batrachia; Anura; Mesobatrachia; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus; Xenopus. 

OX NCBI_TaxID=8355 ; 

RN [1] 

RP ^^UCLEOTIDE SEQUENCE. 

RC TISSUE=Einbryo; 

RX MEDLINE=22341132; PubMed=12454917 ; DOI=10 . 1002/dvdy. 10174; 

RA Klein S.L., Strausberg R.L., Wagner L., Pontius J,, Clifton S.W. , 

RA Richardson P. ; 

RT "Genetic and genomic tools for Xenopus research: The NIH Xenopus 

RT initiative. " ; 

RL Dev. Dyn, 225:384-391(2002). 

RN [2] 

RP NUCLEOTIDE SEQUENCE, 

RC TISSUE=Embryo; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073 /pnas . 242603899 ; 

RA Strausberg R.L., Feingold E.A, , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B. , Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T. , Max S.I., Wang J.. Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M., Scares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A, , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J. , McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K. , Muzny D.M. , Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E. . Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. , Green E.D,, Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences . " ; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN (33 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Embryo; 

RA Klein S., Gerhard D.S.; 

RL Submitted (NOV-2004) to the EMBL/GenBank/DDBJ databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; BC086266; AAH86266.1; -; mRNA. 

DR GO; GO: 0031012; C : extracellular matrix; lEA. 

DR GO; GO: 0004222; F : metalloendopept idase activity; lEA. 

DR GO; GO: 0006508; P: proteolysis; lEA. 



DR InterPro; IPR006586; ADAM_cysteine. 

DR InterPro; IPR010294; ADAM_spacerl . 

DR InterPro; IPR013273; ADAM_TS. 

DR InterPro; IPR013274; ADAM_TS1 . 

DR InterPro; IPR006025; Pept_M_Zn_BS . 

DR InterPro; IPR001590; Peptidase_M12B. 

DR InterPro; IPR002870; Peptidase_Ml2B_N. 

DR InterPro; IPR000884; TSPl. 

DR InterPro; IPR008085; TSP_1 . 

DR Pfam; PF05986; ADAM_spacerl ; 1. 

DR Pfam; PF01562; Pep_Ml2B_propep; 1. 

DR Pfam; PF01421; Reprolysin; 1. 

DR Pfam; PF00090; TSP_1; 3. 

DR PRINTS; PR01705; TSPlREPEAT. 

DR SMART; SM00209; TSPl; 3. 

DR PROSITE; PS50215; ADAM^MEPRO; 1. 

DR PROSITE; PS50092; TSPl; 3. 

DR PROSITE; PS00142; ZINC_PROTEASE; UNKNOWN. 1 . 

SQ SEQUENCE 928 AA; 102210 MW; 6293D299A9CDC4FD CRC64; 

Query Match 47.9%; Score 2472.5; DB 2; Length 928; 

Best Local Similarity 48.5%; Pred. No. 1.8e-170; 

Matches 468; Conservative 169; Mismatches 268; Indels 59; Gaps 17; 

Qy 3 LLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAFQE 62 

II : III I hi hl|: :|h: = : : II 

Db 7 LLRWMAALLCARNALGSTMEEELVAPVRLEA VTAQAESGEEKIRYHLNVFGE 58 

Qy 63 DFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGS SDLRRCFYSGDVNAEPDSFAAV 117 

I I II III II :::| I : llh ::|hlllll II : I Ih 

Db 59 SLTLELEPDPSFLASDFSLQYVGFPAE-RTGGAAAEPLAELRKCFYSGTVNGDSSSSAAL 117 

Qy 118 SLCGGLRGAFGYRGAEYVISPLPNASAPAAQRNSQG AHLLQRRGVPGGPSGDPTSRC 174 

III llllli III Ihll I : I I II:: II ::| 

Db 118 SLCSGLRGAFFYRGEEYLIQP-SNHTLTVADPKSPDPLVLHLVRRR SRAKAKC 169 

Qy 175 GVASGWNPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFH 234 

II : I : I :h Mill I I I I I I : : I I I : M =11 

Db 170 GVTDPVEQPVKSEDTPTQRTMPPTGSGSTRK KRFVSSPRYVETMLVADQSMAEFH 224 

Qy 235 GADLEHYLLTLLATAARLYRHPSILNPINIWVKVLLLRDRDSGPKVTGNAALTLRNFCA 294 

|: hlllllh: l|:||:|lll I h:|llh|:: I : II h IIIIMMII: 
Db 225 GSSLKHYLLTLMSVAAKLYKHPSIRNSISLVWKILVIYDEEKGPDVSSNAALTLRNFCS 284 

Qy 295 WQKKLNKVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLP 354 

III: I Ihl |::|IMIIIIIIIIII I I I I II I I t I II : I I I lllhllllli 
Db 285 WQKQHNPPSDRHAEHYDTAILFTRQDLCGAQTCDTLGMADVGTICDPTRSCSIIEDDGLQ 344 

Qy 355 SAFTTAHELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDF 414 

Hllhllllllllllll I I : I : :Mh I \ \ Ml III Ml I 
Db 345 AAFTTSHELGHVFNMPHDEAKQCSGLNGANQDSHMMASMLSNLDRYEPWSPCSAYMITSF 404 

Qy 415 LDSGHGDCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCP-YMQYCTKLWCTGK 473 

Ihllhlllhl :|l II nil I ::||: II h II M Mill 

Db 405 LDNGHGECLLDKPHRPIQLPSDLPGTLYDANKQCQFTFGDESRHCPDAASTCTTLWCTGV 464 

Qy 474 AKGQMVCQTRHFPWADGTSCGEGKLCLKGACVERHNLNKH RVDGSWAKWDPYGPCSR 530 

: I Mllhllllllllllllll |: I |::: : II I I I I M III 

Db 465 SGGLLVCQTKHFPWADGTSCGEGKWCINGKCMDQTD-KKHYETPVHGGWGTWGQWGECSR 523 

Qy 531 TCGGGVQLARRQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGY 590 

II MM : |:| M I IIIIMII MM: MM: || : MMMIIIMI I 
Db 524 TCDGGVQYSIRECDNPVPKNGGKYCEGKRVQYQSCNIQDCPEN-NGKTYREEQCEAHNDL 582 

Qy 591 NHSTNRLTLAVAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCV 650 

: I II I IIMIMMhIIIMII IIIIIMI IIMIII IMIIIMM 

Db 583 SKSPFGSGPAVEWTPKYAGVSPKDRCKLMCRAKGTGYFFVLNPKWDGTPCSPDSTSMCI 642 

Qy 651 QGKCIKAGCDGNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSID 710 

IhhIIIII :|l Mlllllllll: Mllhl I II II II l|::h 

Db 643 QGQCVKAGCDRVIGSSKKFDKCGVCGGNGSTCKKVSGNFVISRPGYQDWTIPTGATNIE 702 



Qy 711 IRQRGYKGLIGDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQ 770 

::|| :| I ::||:| : I hill : :| : \ \ • hHlllh : = [ 
Db 703 VKQRNNRGSRHDGSFLAIKAADGTYILNGDYTLSTLEQDITHNGNVLRYSGSSASLERIR 762 

Qy 771 ASRPILEPIiTVEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSN 830 

: |: l|:|::||:|| :::| ::: j : :| : h : : -11 
Db 763 SFSPLKEPITIQVLTVGDSHRLKIKYVYFVKKTGQPEKPNKKKE— SFNAIRETILS 817 

Qy 831 QVEQPDDRPPARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAH--RPVETQ 888 

II II II :|l I |:| hh II |: Ih U : : 
Db 818 EWVIEEWGECSKTCGLGWQRRKVECKDINGQ PSMDCANELKPDDIR 863 

Qy 889 ACGE-PCPTWELSAWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQEL-DFCV 946 

I : III hi II llhlhlhU MM : I : h Mh I III 

Db 864 PCADTPCPQWQLGDWSSCSKTCGKGFKKRLLKCVSYDGVNMPQENCDSLKKPKHLIDFCT 923 

Qy 947 LRPC 950 



924 LANC 927 



RESULT 13 
Q8NE26_HUMAN 

ID Q8NE26_HUMAN PRELIMINARY; PRT; 967 AA. 

AC Q8NE26; 

DT Ol-OCT-2002, integrated into UniProtKB/TrEMBL . 

DT Ol-OCT-2002, sequence version 1. 

DT 07-FEB-2006, entry version 18. 

DE ADAM metal lopeptidase with thrombospondin type 1 motif, 1, 

DE preproprotein . 

GN Name=ADAMTSl ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; 

OC Homo . 

OX NCBI_TaxID=9606; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Testis; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas . 242603899 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G. , 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H.. Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M. , Hong L., 

RA Stapleton M., Scares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brovmstein M. J. , Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A, , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J.A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L. J. , Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E. , Ketteman M., Madan A,, Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y,, Bouffard G.G., 

RA Blakesley R.W. , Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I,, Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences . " ; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [2] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Testis; 

RG NIH MGC Project; 

RL Submitted (AUG-20Q2) to the EMBL/GenBank/DDBJ databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; BC036515; AAH36515.1; -; mRNA. 



DR HSSP; P07996; ILSL. 

DR Ensembl; ENSG00000154734 ; Homo sapiens. 

DR GO; 00:0031012; C: extracellular matrix; lEA. 

DR GO; GO: 0004222; F : metal loendopeptidase activity; lEA. 

DR GO; GO: 0006508; P : proteolysis ; lEA. 

DR Inter Pro; IPR006586; ADAM_cysteine. 

DR InterPro; IPR010294; ADAM_spacerl . 

DR InterPro; IPR013273; ADAM_TS. 

DR InterPro; IPR013274; ADAM^TSl . 

DR InterPro; IPR001128; Cytochrome_P450 . 

DR InterPro; IPR006025; Pept_M_2n_BS . 

DR InterPro; IPR001590; Peptidase_Ml2B. 

DR InterPro; IPR002870; Peptidase_Ml2B_N. 

DR InterPro; IPR000884; TSPl . 

DR InterPro; IPR008085; TSP_1 . 

DR Pfam; PF05986; ADAM_spacerl ; 1. 

DR Pfam; PF01562; Pep_Ml2B_propep; 1. 

DR Pfam; PF01421; Reprolysin; 1. 

DR Pfam; PF00090; TSP_1; 3. 

DR PRINTS; PR01705; TSPIREPEAT. 

DR SMART; SM00608; ACR; 1. 

DR SMART; SM00209; TSPl; 3. 

DR PROSITE; PS50215; ADAM_MEPRO; 1. 

DR PROSITE; PS00086; CYTOCHROME_P450 ; UNKN0WN_1 . 

DR PROSITE; PS50092; TSPl; 3. 

DR PROSITE; PS00142; ZINC_PROTEASE; UNKN0WN_1 . 

SQ SEQUENCE 967 AA; 105388 MW; FF1D399674201C3D CRC64; 



Query Match 47.9%; Score 2472.5; DB 2; Length 967; 

Best Local Similarity 48.8%; Pred. No. 1.9e-170; 

Matches 484; Conservative 154; Mismatches 252; Indels 101; Gaps 24; 

Qy 1 MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

:||| II : I : hill h III II 
Db 36 LLLLAAALLAVSDALGRPSEEDEELWP-ELE RAP GHGTTRLRLHAF 81 

Qy 61 QEDFYLHLTPDAQFLAPAFSTEHLG VPLQGLTGGSSDLRRCFYSGDVNAEPDSF 114 

: I I II: nil h :::| II HI Mill II :| I 

Db 82 DQQLDLELRPDSSFLAPGFTLQNVGRKSGSETPLP ETDLAHCFYSGTVNGDPSSA 136 

Qy 115 AAVSLCGGLRGAFGYRGAEYVISPLPNAS APAAQRNSQGA HLLQ--RRGVPGG 165 

Ihlll hllll I I I III II Ml I III: I:| II 

Db 137 AALSLCEGVRGAFYLLGEAYFIQPLPAASERLATAAPGEKPPAPLQFHLLRRNRQGDVGG 196 

Qy 166 PSG DPTSRC GVASG--WNPAILRALDPYKPRRAGFGESRSRRRS 207 

I II : III: I I Ih : I 

Db 197 TCGWDDEPRPTGKAETEDEDEGTEGEDEGPQWS PQDPALQGVGQP-TGTGS 247 

Qy 208 GRAKRFVSIPRYVETLWADESMVKFHGADLEHYLLTLLATAARLYRHPSILNPINIVW 267 

I Mill llll|::|lhll :|l|: hllllM : llllhllll I :::|ll 
Db 248 IRKKRFVSSHRYVETMLVADQSMAEFHGSGLKHYLLTLFSVAARLYKHPSIRNSVSLVW 307 

Qy 268 KVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLNKVSDKHPEYWDTAILFTRQDLCGATTC 327 

|:|:: I Ihll llllllllll llh I Ih h : I I I I I I M I I II h II 
Db 308 KILVIHDEQKGPEVTSNAALTLRNFCNWQKQHNPPSDRDAEHYDTAILFTRQDLCGSQTC 367 

Qy 328 DTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTAHELGHVFNMPHD^A/KVCEEVFGKLRAN 387 

lllllllllhlll IMIIIMIII :|Mlllllllllllllh I I : I : : 
Db 368 DTLGMADVGTVCDPSRSCSVIEDDGLQAAFTTAHELGHVFNMPHDDAKQCASLNGVNQDS 427 

Qy 388 HMMSPTLIQIDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPISLPEDLPGASYTLSQQ 447 

llh I :| : III III :|l Mhllhlhhl II M MM M ::| 
Db 428 HMMASMLSNLDHSQPWSPCSAYMITSFLDNGHGECLMDKPQNPIQLPGDLPGTSYDANRQ 487 

Qy 448 CELAFGVGSKPCP-YMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKLCLKGACVE 506 

h II II II h Mill : I :|llhllllllllllllll h I II 

Db 488 CQFTFGEDSKHCPDAASTCSTLWCTGTSGGVLVCQTKHFPWADGTSCGEGKWCINGKCVN 547 

Qy 507 RHNLNKH RVDGSWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGVRVKYR 563 

: : II III I hi llllllllll hi II I IIIIIMI Ihll 

Db 548 KTD-RKHFDTPFHGSWGMWGPWGDCSRTCGGGVQYTMRECDNPVPKNGGKYCEGKRVRYR 606 



Qy 564 SCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKLICRAN 623 

Hill II : :||:|lllllll I := := II I : I I I : I I I h h I I I I I = I 

Db 607 SCNLEDCPDN-NGKTFREEQCEAHNEFSKASFGSGPAVEWIPKYAGVSPKDRCKLICQAK 665 



Qy 624 GTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCDGNLGSKKRFDKCGVCGGDNKSCK 683 

I llhll lllllll II llllllllhhillll : llhlllllllll: :|l 

Db 666 GIGYFFVLQPKWDGTPCSTDSTSVCVQGQCVKAGCDRIIDSKKKFDKCGVCGGNGSTCK 725 

Qy 684 KVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLNGHFW 743 

|::| I It: =: II IhH-UI :| = -Ihl : I hill : 

Db 726 KISGSVTSAKPGYHDIITIPTGATNIEVKQRNQRGSRNNGSFLAIKAADGTYILNGDYTL 785 

Qy 744 SAVERDLWKGSLLRYSGTGTAVESLQASRPIbEPLTVEVLSVGKMTPPRVRYSFYLPKE 803 

I :|:|:: II :|l|l|: hi h lllhHhIl |:::|:::: h 

Db 786 STLEQDIMYKGWLRYSGSSAALERIRSFSPLKEPLTIQVLTVGNALRPKIKYTYFVKKK 84 5 

Qy 804 PREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPPARWVAGSWGPCSASCGSGLQKRA 863 

II h: : I II II II II I hi 

Db 846 KES FNAIPTFS AWVIEEWGECSKSCELGWQRRL 878 

Qy 864 VDCRGSAGQRTVPACDAAH--RPVETQACGE-PCPTWELSAWSPCSKSCGRGFQRRSLKC 920 

hll II II : I :| h I : III hi II I I h I h h : : I I I I I 
Db 879 VECRDINGQ PASECAKEVKPASTRPCADHPCPQWQLGEWSSCSKTCGKGYKKRSLKC 935 

Qy 921 VGHGGRLLARDQCNLHRKPQE-LDFCVLRPC 950 

: I I :h : h :|h Hll : I 

Db 936 LSHDGGVLSHESCDPLKKPKHFIDFCTMAEC 966 

RESULT 14 
ATS1_RAT 

ID ATS1_RAT STANDARD; PRT; 967 AA. 

AC Q9WUQ1; Q9ERI1; 

DT Ol-DEC-2000, integrated into UniProtKB/Swiss-Prot . 

DT Ol-NOV-1999, sequence version 1. 

DT 07-MAR-2006, entry version 57. 

DE ADAMTS-1 precursor {EC 3.4.24.-) (A disintegrin and metalloproteinase 

DE with thrombospondin motifs 1) (ADAM-TS 1) (ADAM-TSl) . 

GN Name = Adam t s 1 ; 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodent ia; Sciurognathi ; 

OC Muroidea; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [11 

RP NUCLEOTIDE SEQUENCE [MRNA] . 

RC STRAIN=Sprague-Dawley; TISSUE=Brain; 

RA Liu X., Tu Y., Yin T. , Johnstone E.M. , Stephenson D.T. , Clemens J. A., 

RA Little S.P. ; 

RT "Induction of a disintegrin and metal loprot ease with the 

RT thrombospondin type I motif (ADAMTS)."; 

RL Submitted (MAY-1999) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP NUCLEOTIDE SEQUENCE [MRNA] OF 18-967. 

RC STRAIN=Sprague-Dawley; TISSUE=Liver ; 

RX MEDLINE=20304099; PubMed= 108474 86 ; 

RX 001=10.1034/ j .1600-0676, 2000. 020002165.x; 

RA Diamantis I., Luethi M. , Hoesli M. , Reichen J.; 

RT "Cloning of the rat ADAMTS-1 gene and its down regulation in 

RT endothelial cells in cirrhotic rats."; 

RL Liver 20:165-17 2(2000). 

CC -!- FUNCTION: Cleaves aggrecan, a cartilage proteoglycein, and may be 

CC involved in its turnover. Has angiogenic inhibitor activity (By 

CC similarity) . Active metalloprotease, which may be associated with 

CC various inflammatory processes as well as development of cancer 

CC cachexia. May play a critical role in follicular rupture (By 

CC similarity) . 

CC CATALYTIC ACTIVITY: Cleaves aggrecan at the 1683-Glu- | -Leu-1684 

CC site, within the chondroitin sulfate attachment domain. 

CC -!- COFACTOR: Binds 1 zinc ion per subunit (By similarity). 



cc 


SUBCELLULAR LOCATION: 


Secreted protein; extracellular space; 


cc 


extracellular matrix 


(By similarity) . 


cc 


-!- INDUCTION: Down-regulated in endothelial cells derived from 


cc 


cirrhotic liver. 




cc 


DOMAIN: The Spacer domain and the TSP type-1 domains are important 


cc 


£or a 


tight interaction with the extracellular matrix. 


cc 


PTM: 


The precursor is 


cleaved by a fur in endopeptidase (By 


cc 


similarity) . 




cc 


-!- SIMILARITY: Contains 


1 disintegrin domain. 


cc 


-!- SIMILARITY: Contains 


1 peptidase M12B domain. 


cc 
cc 
cc 


-!- SIMILARITY: Contains 


3 TSP type-1 domains . 


Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 


cc 
cc 

DR 


Distributed under the Creative Commons Attribution-NoDerivs License 


EMBL; AF149118; AAD34012 . 


1; -; mRNA. 


DR 


EMBL; AF304446; AAG29823 . 


1; -; mRNA. 


DR 


HSSP; P07996; ILSL. 




DR 


MEROPS; M12.222; 




DR 


Ensembl ; 


ENSRNOG00000001607; Rattus norvegicus. 


DR 


RGD; 621241; Adamtsl. 




DR 


InterPro; 


IPR006586; ADAM_cysteine . 


DR 


Inter Pro ; 


IPR010294; ADAM 


_spacerl . 


DR 


InterPro; 


IPR013273; ADAM 


_TS. 


DR 


InterPro; 


IPR013274; ADAM 


_TS1. 


DR 


InterPro; 


IPR001762; Disintegrin. 


DR 


InterPro; 


IPR006025; Pept 


_M_Zn_BS . 


DR 


InterPro; 


IPR001590; Peptidase_Ml2B . 


DR 


InterPro; 


IPR002870; Peptidase_Ml2B_N. 


DR 


InterPro; 


IPR000884; TSPl 




DR 


InterPro; 


IPR008085; TSP_ 


1. 


DR 


Pfam; PF05986; ADAM_spacerl ; 1. 


DR 


Pfam; PF01562; Pep_Ml2B_propep; 1. 


DR 


Pfam; PF01421; Reprolysin 


; 1. 


DR 


Pfam; PF00090; TSP_1; 3. 




DR 


PRINTS; PR01858; ADAMTSl . 




DR 


PRINTS; PR01857; ADAMTSFAMILY . 


DR 


PRINTS; PR01705; TS Pi REPEAT. 


DR 


SMART; SM00608; ACR; 1. 




DR 


SMART; SM00209; TSPl; 3. 




DR 


PROSITE; 


PS50215; ADAM_MEPRO; 1. 


DR 


PROSITE; 


PS00427; DISINTEGRIN_1 ; FALSE_NEG. 


DR 


PROSITE; 


PS50214; DISINTEGRIN_2; FALSE_NEG. 


DR 


PROSITE; 


PS50092; TSPl; 3 




DR 


PROSITE; 


PS00142; ZINC_PROTEASE; 1. 


KW 


Extracellular matrix; Glycoprotein; Heparin-binding; Hydrolase; 


KW 


Metal -binding; Metal loprotease; Protease; Repeat; Signal; Zinc; 


KW 


Zymogen . 






FT 


SIGNAL 


1 54 


Potential . 


FT 


PROPEP 


55 252 


By similarity. 


FT 






/FTId=PRO_0000029154 . 


FT 


CHAIN 


253 967 


ADAMTS-1. 


FT 






/FTId=PRO_0000029155 . 


FT 


DOMAIN 


258 467 


Peptidase M12B. 


FT 


DOMAIN 


476 558 


Disintegrin. 


FT 


DOMAIN 


559 614 


TSP type-1 1. 


FT 


DOMAIN 


854 910 


TSP type-1 2. 


FT 


DOMAIN 


911 967 


TSP type-1 3. 


FT 


REGION 


725 857 


Spacer . 


FT 


COMPBIAS 


194 198 


Poly-Arg. 


FT 


COMPBIAS 


616 724 


Cys-rich. 


FT 


ACT_SITE 


402 402 


By similarity. 


FT 


METAL 


401 401 


Zinc (catalytic) (By similarity) . 


FT 


METAL 


405 405 


Zinc (catalytic) (By similarity) . 


FT 


METAL 


411 411 


Zinc (catalytic) (By similarity) . 


FT 


SITE 


205 205 


Cysteine switch (Potential) . 


FT 


CARBOHYD 


547 547 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


720 720 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


764 764 


N-lin)ced (GlcNAc. . ,) (Potential). 


FT 


CARBOHYD 


782 782 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


945 945 


N-linked (GlcNAc. . .) (Potential). 



FT 


DISULFID 


379 


462 


By similarity. 


FT 


DISULFID 


417 


446 


By similarity. 


FT 


DISULFID 


571 


608 


By similarity. 


FT 


DISULFID 


575 


613 


By similarity. 


FT 


DISULFID 


586 


598 


By similarity. 


FT 


CONFLICT 


21 


21 


I -> V (in Ref . 2) . 


FT 


CONFLICT 


26 


31 


KFRSSQ -> RSRGSL {in Ref. 2). 


FT 


CONFLICT 


49 


49 


V -> A (in Ref . 2) . 


FT 


CONFLICT 


72 


72 


R -> P (in Ref. 2) . 


FT 


CONFLICT 


79 


79 


L -> TR (in Ref. 2) . 


FT 


CONFLICT 


249 


/I o 


R -> G (in Ret. z) . 


FT 


CONFLICT 


262 


265 


TMLV -> NLLK (in Ref. 2). 


FT 


CONFLICT 


607 


607 


S -> F (in Ref. 2) . 


FT 


CONFLICT 


936 


936 


L -> V (in Ref. 2) . 


FT 


CONFLICT 


962 


962 


I -> T (in Ref. 2) . 


SQ 


SEQUENCE 


967 AA; 


105706 


MW; F93C864F6DCDB4CF CRC64; 


Query Match 




47.9%; 


Score 2471; DB 1; Length 967 



Best Local Similarity 48.4%; Pred. No. 2.4e-170; 

Matches 477; Conservative 161; Mismatches 256; Indels 92; Gaps 22; 



Qy 1 MLLLGILTLAFAGRTAGG— FEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQIT 58 

:||| :|: I I I I : hh I : I = I : =: 

Db 37 LLLLASITMLLCVRGAHGRPTEEDEELVL PSLERARGH DSTTLLRLD 83 

Qy 59 AFQEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGS SDLRRCFYSGDVNAE 110 

II : :M II: Mil h : : I : II II Hill II : 

Db 84 AFGQQLHLKLQPDSGFLAPGFTLQTV GRSPGSEAQHLDPTGDLAHCFYSGTVNGD 138 

Qy 111 PDSFAAVSLCGGLRGAFGYRGAEYVISPLPNAS APAAQRNSQGA HLL--QRR 160 

I I Ihlll hllll :||: I I 1 : II : I hi W 

Db 139 PSSAAALSLCEGVRGAFYLQGEEFFIQPAPAVATERLVPAEPKEESIAPPRFHILRRRRR 198 

Qy 161 GVPGGPSGD PTSRCGVASGWNPAILRALDPYKPRRAGFGESRSRRRSGRAKRFV 214 

III III I I I :| I: II : I I Mil 

Db 199 GSGGAKCGVMDEETLPTSNSGRESQNTPDQWPLRNP-TPQGAG KPTGPGSIRKKRFV 254 

Qy 215 SIPRYVETLWADESMVKFHGADLEHYLLTLLATAARLYRHPSILNPINIVWKVLLLRD 274 

I ||||||::|||:|| llh hllllll = III hllll I |::|ll|:|:: : 
Db 255 SSPRYVETMLVADQSMADFHGSGLKHYLLTLFSVAARFYKHPSIRNSISLVWKILVIYE 314 

Qy 275 RDSGPKVTGNAALTLRNFCAWQKKLNKVSDKHPEYWDTAILFTRQDLCGATTCDTLGMAD 334 

||:|| lllllllllhllj: I II: I I :: I I I I I I I I I I I I I : lllllllll 
Db 315 EQKGPEVTSNAALTLRNFCSWQKQHNSPSDRDPEHYDTAILFTRQDLCGSHTCDTLGMAD 374 

Qy 335 VGTMCDPKRSCSVIEDDGLPSAFTTAHELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTL 394 

llhlll IIIIIIIMII :|lllllllllllllil|: I I I H:|: I 

Db 375 VGTVCDPSRSCSVIEDDGLQAAFTTAHELGHVFNMPHDDAKHCASFNGVSGDSHLMASML 434 

Qy 395 IQIDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPISLPEDLPGASYTLSQQCELAFGV 454 

:| : III III ::| ilhllhlhhl II II Mil I : = lh II 
Db 435 SSLDHSQPWSPCSAYMVTSFLDNGHGECLMDKPQNPIKLPSDLPGTLYDANRQCQFTFGE 4 94 

Qy 455 GSKPCP-YMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKLCLKGACVERHNLNKH 513 

I II |: Mill : I :||||:|lllllllllllll hill: := II 

Db 495 ESTHCPDAASTCSTLWCTGTSGGLLVCQTKHFPWADGTSCGEGKWCVSGKCVNKTDM-KH 553 

Qy 514 RVDGSWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGVRVKYRSCNLEPC 570 

I III I hi llllllllll hi II I llllllll Ihllllhl I 
Db 554 FATPVHGSWGPWGPWGDCSRTCGGGVQYTMRECDNPVPKNGGKYCEGKRVRYRSCNIEDC 613 

Qy 571 PSSASGKSFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKLICRANGTGYFYV 630 

I : :|hllllllll I :: :: II llhlllhhill I I I llhl 

Db 614 PDN-NGKTFREEQCEAHNEFSKASFGNEPTVEWTPKYAGVSPKDRCKLTCEAKGIGYFFV 672 

Qy 631 LAPKWDGTLCSPDSTSVCVQGKCIKAGCDGNLGSKKRFDKCGVCGGDNKSCKKVTGLFT 690 

I lllllll lllllllllllhhillll : llhllllllllh :|lh:| I 
Db 673 LQPKWDGTPCSPDSTSVCVQGQCVKAGCDRIIDSKKKFDKCGVCGGNGSTCKKISGTVT 732 

Qy 691 KPMHGYNFWAIPAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLNGHFWSAVERDL 750 

Ih :| llllh:!::: I =1 : ::|h: : I hllhl :| :hll 



Db 



733 STRPGYHDIVTIPAGATNIEVKHRNPRGSRNNGSFLAIRAADGTYILNGNFTLSTLEQDL 792 



Qy 751 VVKGSLLRYSGTGTAVESLQASRPILEPLTVEVLSVGKMTPPRVRYSFYLPK--EPREDK 808 

l|::||li|: hi h lllhHI II |:::|:::: | || 
Db 793 TYKGTVLRYSGSSAALERIRSFSPLKEPLTIQVLMVGHALRPKIKYTYFMKKKTEP 848 



Qy 809 SSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPPARWVAGSWGPCSASCGSGLQKRAVDCRG 868 

]:: : I II II II Mill hi hll 

Db 849 FNAIPTFS EWVIEEWGECSKTCGSGWQRRWECRD 883 



Qy 869 SAGQRTVPACDAAH--RPVETQACGE-PCPTWELSAWSPCSKSCGRGFQRRSLKCVGHGG 925 

I II : I :| h I : III h: I I I I I h I h h : : h II h I I 

Db 884 INCH PASECAKEVKPASTRPCADLPCPRWQVGDWSPCSKTCGKGYKKRTLKCLSHDG 940 



Qy 926 RLLARDQCNLHRKPQE-LDFCVLRPC 950 

:|: : h :|h :||hl I 
Db 941 GVLSNESCDPLKKPKHYIDFCILTQC 966 



RESULT 15 
Q68EJ2_RAT 

ID Q68EJ2_RAT PRELIMINARY; PRT; 967 AA. 

AC Q68EJ2; 

DT ll-OCT-2004, integrated into UniProtKB/TrEMBL, 

DT ll-OCT-2004, sequence version 1. 

DT 07-FEB-2006, entry version 11. 

DE A disintegrin and metal loproteinase with thrombospondin motifs 1. 

GN Name=Adamtsl ; 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Euarchontoglires ; Glires; Rodentia; Sciurognathi; 

OC Muroidea; Muridae; Murinae; Rattus. 

OX NCBI_TaxID= 10116; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Lung; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073 /pnas . 242603899 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D. , Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L. , Marusina K., Farmer A.A. , Rubin G.M., Hong L, , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S.. Worley K.C., Hale S., Garcia A.M., Gay L. J. , Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S. , Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W. , Touchman J.W., Green E.D., Dickson M.C., 

RA Rodricruez A.C. , Grimwood J., Schmutz J., Myers R.M., 

RA Butter field Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences . " ; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [21 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Lung; 

RA Director MGC Project; 

RL Submitted (AUG-2004) to the EMBL/GenBank/DDBJ databases. 

CO 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; BC080237; AAH80237.1; -; mRNA. 

DR GO; GO: 0031012; C ; extracellular matrix; TEA. 

DR GO; GO: 0004222; F : metal loendopeptidase activity; lEA. 

DR GO; GO: 0008270; F : zinc ion binding; lEA. 

DR GO; GO: 0007229; P : integr in-mediated signaling pathway; lEA. 



DR GO; GO: 0006508; P : proteolysis ; lEA. 

DR Inter Pro; IPR006586; ADAM_cysteine . 

DR InterPro; IPR010294; ADAM_spacerl . 

DR InterPro; IPR013273; ADAM_TS. 

DR InterPro; IPR013274; ADAM^TSl . 

DR InterPro; IPR006025; Pept_M_2n_BS . 

DR InterPro; IPR001590; Peptidase_Ml2B. 

DR InterPro; IPR002870; Peptidase__Ml2B_N . 

DR InterPro; IPR000884; TSPl . 

DR InterPro; IPR008085; TSP_1 . 

DR Pfam; PF05986; ADAM_spacerl ; 1. 

DR Pfam; PF01562; Pep_M12B_propep; 1. 

DR Pfam; PF01421; Reprolysin; 1. 

DR Pfam; PF00090; TSP_1; 3. 

DR PRINTS; PR01705; TSPlREPEAT. 

DR SMART; SM00608; ACR; 1. 

DR SMART; SM00209; TSPl; 3. 

DR PROSITE; PS50215; ADAM_MEPRO; 1. 

DR PROSITE; PS50092; TSPl; 3. 

DR PROSITE; PS00142; ZINC_PROTEASE; 1. 

KW Integrin. 

SQ SEQUENCE 967 AA; 105648 MW; C6349B5D8CBFEA24 CRC64; 

Query Match 47.8%; Score 2466; DB 2; Length 967; 

Best Local Similarity 48.3%; Pred. No. 5.6e-170; 

Matches 477; Conservative 160; Mismatches 255; Indels 96; Gaps 22; 

Qy 1 MLLLGILTLAFAGRTAGG--FEPEREVWPIRLDPDINGRRyYWRGPEDSGDQG--LIFQ 56 

:|: III I : hhl M I : : 

Db 37 LLLLASITMLLCVRGAHGRPTEEDEELVLP SLERARGHGSTTLLR 81 

Qy 57 ITAFQEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGS SDLRRCFYSGDVN 108 

: II : :| I Ih Mil h : : I : II II Mill M 

Db 82 LDAFGQQLHLKLQPDSGFLAPGFTLQTV GRSPGSEAQHLDPTGDLAHCFYSGTVN 136 

Qy 109 AEPDSFAAVSLCGGLRGAFGYRGAEYVISPLPNAS APAAQRNSQGA HLL--Q 158 

:| I Ihlll hllll :| |: I II : II : I hi : 

Db 137 GDPSSAAALSLCEGVRGAFYLQGEEFFIQPAPAVATERLVPAEPKEESIAPPRFHILRRR 196 

Qy 159 RRGVPGGPSGD PTSRCGVASGWNPAILRALDPYKPRRAGFGESRSRRRSGRAKR 212 

III II III I I I :| h II : I I II 

Db 197 RRGSGGAKCGVMDEETLPTSNSGRESQNTPDQWPLRNP-TPQGAG KPTGPGSIRKKR 252 

Qy 213 FVSIPRYVETLWADESMVKFHGADLEHYLLTLLATAARLYRHPSILNPINIVWKVLLL 272 

III IllllhUlhll llh hllllll : III hllll I |::|ll|:|:: 
Db 253 FVSSPRYVETMLVADQSMADFHGSGLKHYLLTLFSVAARFYKHPSIRNSISLVWKILVI 312 

Qy 273 RDRDSGPKVTGNAALTLRNFCAWQKKLNKVSDKHPEYWDTAILFTRQDLCGATTCDTLGM 332 

l|:|| lllllllllhllh I Ih l|::|lllllllllllh lllllll 
Db 313 YEEQKGPEVTSNAALTLRNFCSWQKQHNSPSDRDPEHYDTAILFTRQDLCGSHTCDTLGM 372 

Qy 333 ADVGTMCDPKRSCSVIEDDGLPSAFTTAHELGHVFNMPHDNVKVCEEVFGKLRANHMMSP 392 

llllhill lllllllllll :|IIIIIIIIIIIIIM|: I I I :|:|: 

Db 373 ADVGTVCDPSRSCSVIEDDGLQAAFTTAHELGHVFNMPHDDAKHCASFNGVSGDSHLMAS 432 

Qy 393 TLIQIDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPISLPEDLPGASYTLSQQCELAF 452 

I :| : III III ::| I II = I I I : I I : I : I II II MM I ::||: I 
Db 433 MLSSLDHSQPWSPCSAYMVTSFLDNGHGECLMDKPQNPIKLPSDLPGTLYDANRQCQFTF 492 

Qy 453 GVGSKPCP-YMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKLCLKGACVERHNLN 511 

I I II |: Mill : I Mllhllllllllllllll |: I II : =: 

Db 493 GEESTHCPDAASTCSTLWCTGTSGGLLVCQTKHFPWADGTSCGEGKWCVSGKCVNKTDM- 551 

Qy 512 KH RVDGSWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGVRVKYRSCNLE 568 

M I III I hi llllllllll |:| II I llllllll Ihlllll:! 

Db 552 KHFATPVHGSWGPWGPWGDCSRTCGGGVQYTMRECDNPVPKNGGKYCEGKRVRYRSCNIE 611 



Qy 569 PCPSSASGKSFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKLICRANGTGYF 628 

II : :||:|MMIM I =: II llhlllMhIII I I I Ml 

Db 612 DCPDN-NGKTFREEQCEAHNEFSKASFGNEPTVEWTPKYAGVSPKDRCKLTCEAKGIGYF 670 



Qy 629 YVLAPKWDGTLCSPDSTSVCVQGKCIKAGCDGNLGSKKRFDKCGVCGGDNKSCKKVTGL 688 

:|l lllllll lllllllllllhhillll : llhllllllllh 
Db 671 FVLQPKWDGTPCSPDSTSVCVQGQCVKAGCDRIIDSKKKFDKCGVCGGNGSTCKKISGT 730 

Qy 689 FTKPMHGYNFWAIPAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLNGHFWSAVER 748 

I II: :| ll|l|::|::: I =1 : -ll- = I hllhl H =1 = 

Db 731 VTSTRPGYHDIVTIPAGATNIEVKHRNPRGSRNNGSFLAIRAADGTYILNGNFTLSTLEQ 790 

Qy 749 DLWKGSLLRYSGTGTAVESLQASRPILEPLTVEVLSVGKMTPPRVRYSFYLPK--EPRE 806 

II l|::||ll|: hi h lll|::|l II |:::|:::: I II 

Db 791 DLTYKGTVLRYSGSSAALERIRSFSPLKEPLTIQVLMVGHALRPKIKYTYFMKKKTEP-- 848 

Qy 807 DKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPPARWVAGSWGPCSASCGSGLQKRAVDC 866 

|:: : I II II II HIM hi hi 

Db 849 FNAIPTFS EWVIEEWGECSKTCGSGWQRRWEC 881 

Qy 867 RGSAGQRTVPACDAAH--RPVETQACGE-PCPTWELSAWSPCSKSCGRGFQRRSLKCVGH 923 

I I II : I :| h I : INI:: I I I II h I h h : : h I I h I 

Db 882 RDINGH PASECAKEVKPASTRPCADLPCPRWQVGDWSPCSKTCGKGYKKRTLKCLSH 938 

Qy 924 GGRLLARDQCNLHRKPQE-LDFCVLRPC 950 

I :|: : h Hh Ulhl I 
Db 939 DGGVLSNESCDPLKKPKHYIDFCILTQC 966 



Search completed: August 5, 2006, 00:13:48 
Job time : 323 sees 



GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM protein - protein search, using sw model 
Run on: August 5. 2006, 00:04:34 ; 



Search time 198 Seconds 
(without alignments) 
2193.715 Million cell updates/sec 



Title: 



US-10-763-210-1 



Perfect score: 5164 



Sequence : 



1 MLLLGILTLAFAGRTAGGFE DQCNLHRKPQELDFCVLRPC 950 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



2589679 seqs, 457216429 residues 



2589679 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : A_Geneseq_8 : * 

1 : geneseqpl980s : * 

2: geneseqpl990s: * 

3: geneseqp2000s : * 

4: geneseqp2001s : * 

5: geneseqp2002s: * 

6: geneseqp2003as: * 

7 : geneseqp2 003bs : * 

8: geneseqp2004s : * 

9: geneseqp2005s : * 

10 : geneseqp2006s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or ecjual to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 




Query 








No. 


Score 


Match 


Length DB 


ID 


1 


5164 


100 


0 


950 


4 


AAG62299 


2 


5156 


99 


8 


950 


5 


AAE22541 


3 


5156 


99 


8 


950 


6 


ABU09520 


4 


5156 


99 


8 


950 


8 


ADQ88214 


5 


5149 


99 


7 


950 


6 


ABR40092 


6 


5111 


99 


0 


952 


5 


AAU74751 


7 


4915 


95 


2 


924 


5 


ABP70062 


8 


4856.5 


94 


0 


928 


5 


AAU72899 


9 


4242.5 


82 


2 


823 


6 


ABU08383 


10 


3957.5 


76.6 


755 


5 


ABP70063 


xir 


2'4'97.5 


48 


4 


5'05 


3 


AAB2 12 57' 


12 


2482.5 


48 


1 


968 


6 


ABU08387 


13 


2480.5 


48 


0 


949 


7 


ADG72483 


14 


2480.5 


48 


0 


949 


9 


ADZ21014 


15 


2480.5 


48 


0 


950 


2 


AAY49501 


16 


2480.5 


48 


0 


950 


4 


AAB73549 


17 


2480,5 


48 


0 


950 


4 


AAB50002 


18 


2480.5 


48 


0 


950 


8 


ADO20218 


19 


2480.5 


48 


0 


950 


8 


ADQ39942 


20 


2480.5 


48 


0 


950 


9 


ADZ21015 


21 


2480.5 


48 


0 


950 


9 


AEC01581 



Description 



Aag62299 Human met 
Aae22541 Human pro 
Abu09520 Human pro 
Adq88214 Human 655 
Abr40092 Human ADA 
Aau74751 Human pro 
Abp70062 Human NOV 
Aau72899 Human met 
Abu08383 Human mat 
Abp70063 Human NOV 
-Aab2i-2 5?— Rat-metaa- 
Abu08387 Murine ma 
Adg72483 Human agg 
Adz21014 Human agg 
Aay49501 Human MET 
Aab73549 Human ADA 
Aab50002 Human MET 
Ado20218 Human PRO 
Adq39942 Human myo 
Adz21015 Human agg 
Aec01581 Human ADA 



\ 



22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 
43 
44 
45 



2136.5 
2135.5 
2135.5 
2134.5 
2130.5 
2130.5 



2480.5 
2480.5 
2480.5 
2480.5 
2480.5 
2480.5 
2480.5 
2479.5 
2476.5 
2458.5 
2458.5 
2458.5 
2458.5 
2458.5 
2458.5 
2445.5 



2322 
2274 



48.0 
48.0 
48.0 
48.0 
48.0 
48.0 
48.0 
48.0 
48.0 
47.6 
47.6 
47.6 
47.6 
47.6 
47.6 
47.4 
45.0 
44.0 
41.4 
41.4 
41.4 
41.3 
41.3 
41.3 



950 
967 
967 
967 
967 
968 
999 
967 
967 
967 
967 
967 
967 
967 
967 
950 
896 
727 
890 
890 
890 
924 
890 
890 



9 
2 
8 
8 
9 
4 
9 
2 
8 
2 
6 
6 
7 
7 
7 
3 
3 
2 
8 
2 
4 
8 
6 
9 



AED07146 
AAW80285 
ADQ39940 
ADQ39941 
ADY54944 
AAB50011 
AED74423 
AAY04142 
ADR14133 
AAW78189 
ADA57139 
ADA41003 
ADB91631 
ADC74267 
ADD37948 
AAy53899 
AAB21265 
AAW78435 
ADK70513 
AAy49502 
AAB50003 
ADX68264 
ABP96306 
AED90232 



Aed07146 
Aaw80285 
Adq39940 
Adq39941 
Ady54944 
Aab50011 
Aed74423 
Aay04142 
Adrl4133 
Aaw78189 
Ada57139 
Ada41003 
Adb91631 
Adc74267 
Add37948 
Aay53899 
Aab21265 
Aaw78435 
Adk70513 
Aay49502 
Aab50003 
Adx68264 
Abp96306 
Aed90232 



Respirato 
Humam int 
Humaji myo 
Human myo 
Chronic v 
Protein; 
Human pla 
Human Tan 
Human NF- 
Human sec 
Human sec 
Human sec 
Human sec 
Human sec 
Human sec 
Amino aci 
Mouse met 
Human ADA 
Respirato 
Human MET 
Human MET 
Plant ful 
Human ADA 
Human ADA 



ALIGNMENTS 



RESULT 1 
AAG62299 

ID AAG62299 standard; protein; 950 AA. 
XX 

AC AAG62299; 
XX 

DT 23-AUG-2001 (first entry) 
XX 

DE Human metal loprotease MDTS6 protein. 
XX 

KW Metal loprotease; human; aggrecanase; joint disease; osteoarthritis; 

KW osteopathic; antiarthritic . 

XX 

OS Homo sapiens. 
XX 

PN WO200134785-A1. 
XX 

PD 17-MAy-2001. 
XX 

PF lO-NOV-2000; 2000WO-JP007917 . 
XX 

PR ll-NOV-1999; 99JP-00321740 . 

PR 16-MAY-2000; 2000JP-00144020 . 
XX 

PA (YAMA ) YAMANOUCHI PHARM CO LTD. 

PA (KAZU-) KAZUSA DNA RES INST. 
XX 

PI Yamaji N, Nishimura K, Abe K, Ohara O, Nagase T, Nomura N; 
XX 

DR WPI; 2001-343602/36. 

DR N-PSDB; AAH41003. 
XX 

PT Metalloprotease with aggrecanase activity for treating joint diseases 

PT especially osteoarthritis. 

XX 

PS Claim 1; Page 56-60; 85pp; Japanese. 
XX 

CC This invention relates to a metalloprotease with aggrecanase activity. 

CC The invention includes protein and DNA sequences of the metalloprotease, 

CC vectors containing the DNA, host cells transformed by the vectors, and 

CC antibodies directed against the metalloprotease. The antibodies, protein 

CC and DNA sequences can be used in the treatment and prevention of joint 



V 



CC diseases, particularly osteoarthritis. The treatment may result in 

CC osteopathic and antiarthritic activity. The present sequence represents 

CC the metal loprotease of the invention termed MDTS6 

XX 

SQ Sequence 950 AA; 



Query Match 100.0%; Score 5164; DB 4; Length 950; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 950; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


MLLLGILTLAFAGRTAGGFEPEREWVPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 


60 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 


MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 


60 


Qy 


61 


QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 


120 


Qy 


121 


GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 


180 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 


180 


Qy 


181 


NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 


240 




1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 


240 


Qy 


241 


YLLTLLATAARLYRHPSILNPINIWVKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 


300 




1 1 1 1 1 i 1 1 1 i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


YLLTLLATAARLYRHPSILNPINIWVKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 


300 


Qy 


301 


KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 


360 




1 1 1 1 1 i 1 1 1 1 i 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 




Db 


301 


KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 


360 


Qy 


361 


HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 


420 




i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 


420 


Qy 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 






i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 i 1 1 1 1 1 i 1 1 i 1 t 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 i 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 i M 1 




Db 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 


Qy 


481 


QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 


540 






1 1 i 1 1 1 1 1 1 1 t 1 1 1 i 1 1 1 i 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 


540 


Qy 


541 


RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 






1 1 i 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 I 1 1 1 1 i i 1 1 1 1 1 1 1 1 1 i 1 j 1 1 1 1 1 1 1 i i 1 M 1 1 
1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 i M 1 1 1 M 1 1 1 i 1 1 1 M 1 1 M 1 1 1 1 1 t 1 1 1 11 1 




Db 


541 


RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 


Qy 


601 


VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 


660 




1 1 i 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 t 1 t 1 1 1 1 1 1 i 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 
1 1 1 1 1 t I 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 




Db 


601 


VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 


660 


Qy 


661 


GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 


720 




lllllllllllllllllllllllllllllillllllllllMllllllllllllllllll 




Db 


661 


GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 


720 


Qy 


721 


GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 


780 


Db 


721 


GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 


780 


Qy 


781 


VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 


840 




IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


781 


VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLPiNSVLSLSNQVEQPDDRPP 


840 


Qy 


841 


ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 


900 






llllllllllMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIill 




Db 


841 


ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 


900 


Qy 


901 


AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 





Illlllllllllllllllllllllllllllllllllllllllllllllll 

Db 901 AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 

RESULT 2 
AAE22541 

ID AAE22541 standard; protein; 950 AA. 
XX 

AC AAE22541; 
XX 

DT 26-JUL-2002 (first entry) 
XX 

DE Human protease #2. 
XX 

KW Human; novel human protein; NHP; protease; biological disorder; obesity; 

KW high blood pressure; arthritis; connective tissue disorder; infertility; 

KW gene therapy; enzyme. 
XX 

OS Homo sapiens. 
XX 

PN WO200226949-A2. 
XX 

PD 04-APR-2002. 
XX 

PF 27-SEP-2001; 2001WO-'US030350 . 
XX 

PR 29-SEP-2000; 2000US-0236689P. 
XX 

PA (LEXI-) LEXICON GENETICS INC, 
XX 

PI Friddle CJ, Hilbun E; 
XX 

DR WPI; 2002-372123/40. 

DR N-PSDB; AAD35569. 
XX 

PT Novel nucleic acid encoding a human protease, useful as a hybridization 

PT probe for screening libraries and assessing gene expression patterns. 
XX 

PS Claim 6; Page 36-38; 41pp; English. 
XX 

CC The present sequence is novel human protein (NHP), human protease. NHPs 

CC share structural similarity with animal proteases particularly zinc 

CC metalloproteases . Sequences of the invention are useful in therapeutic, 

CC diagnostic and pharmacogenomic applications. NHP polynucleotides are used 

CC as hybridisation probes for screening libraries and assessing gene 

CC expression patterns. They can also be used for treating related 

CC biological disorders such as obesity, high blood pressure, arthritis, 

CC connective tissue disorders and infertility. They are also used in gene 

CC therapy 

XX 

SQ Sequence 950 AA; 

Query Match 99.8%; Score 5156; DB 5; Length 950; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 949; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

MLLLGILTLAFAGRTAGGFEPEREWVPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

IIIIMIIMIIMIIM MIIIIIIMIIIIIIIIIIillllllllllllllllllll 

MLLLGILTLAFAGRTAGGSEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

lilllilMIIIMIIIIIMIIIIIMIIIIIMIIIillllllllllllllMMIII 

QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 
GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

IIIIIIIIIIIIIIIIIIMIIIIIlllllllllllllllllllllllllllllllllll 

GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSI PRYVETLWADESMVKFHGADLEH 240 

MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIi 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 



Qy 
Db 



241 
241 



YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 

YLLTLLATAARLYRHPSILNPINI WVKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 



Qy 


301 


KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 


360 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 i 1 1 1 1 1 1 1 t 1 1 t 1 1 1 1 i 1 1 1 1 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 11 1 1 1 1 1 M 1 1 M M 1 1 1 M 1 1 1 1 1 1 1 




Db 


301 


KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 


360 


Qy 


361 


HELGHVFNMPHDNVKVCEEVFGKLRANHMMS PTLIQI DRANPWSACSAAI ITDFLDSGHG 


420 




1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 




Db 


361 


HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 


420 


Qy 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 M 1 M M 1 M 1 M 1 1 1 1 1 M M 1 1 1 1 1 M ^ 




Db 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 


Qy 


481 


QTRHF PWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDP YG PCSRTCGGGVQLAR 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 M 1 1 1 M 1 M 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 M M 1 1 1 1 1 1 M 




Db 


481 


QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 


540 


Qy 


541 


RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 






1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 M 1 1 1 1 M M 1 1 M 1 M 1 1 1 1 1 1 M M M 1 ^ 




Db 


541 


RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 


Qy 


601 


VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 


660 




t i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 i 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 i 1 i 1 1 1 1 1 1 1 1 1 i 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 M 1 M M 1 1 M 1 1 M 1 1 1 1 1 1 1 M 




Db 


601 


VAWVPKY SGVS PRDKCKLI CRANGTGYF YVLAPKWDGTLCS PDSTS VCVQGKCI KAGCD 


660 


Qy 


661 


GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHG YNF WAI PAGAS S I DI RQRGYKGL I 


720 




1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 M 1 1 1 M M 1 1 1 1 1 M M 




Db 


661 


GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAI PAGASS IDI RQRGYKGLI 


720 


Qy 


721 


GDDNYLALKNSQGKYLIiNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 


780 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 M 1 1 1 1 M M M 




Db 


721 


GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 


780 


Qy 


781 


VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 


840 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


781 


VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 


840 


Qy 


841 


ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 


900 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


841 


ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 


900 


Qy 


901 


AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 








llllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


901 


AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 





RESULT 3 


ABU0952 0 


ID 


ABU09520 standard; protein; 950 AA. 


XX 




AC 


ABU09520; 


XX 




DT 


30-JUN-2003 (first entry) 


XX 




DE 


Human protease of the metal loprotease family. 


XX 




KW 


Human; SNP; chromosome 11; protease; met all ©protease; cancer; 


KW 


disintegrin; metalloprotease with thrombospondin motif s-1; cachexia 


KW 


cytostatic; immunomodulator ; single nucleotide polymorphism. 


XX 




OS 


Homo sapiens . 


XX 




FH 


Key Location/Qualifiers 


FT 


Misc-dif f erence 220 


FT 


/noce= "Encoded by GCG in the cDNA appearing as 


FT 


ABX95684" 


FT 


Misc-dif ference 566 



FT /note= "May be Lys as the result of a single nucleotide 

FT polymorphism" 

FT Misc-dif ference 623 

FT /notes "Encoded by AGT (cDNA) or ATG (gene)" 
XX 

PN US2002086400-A1. 
XX 

PD 04-JUL-2002. 
XX 

PF 21-DEC-2000; 2000US-00741151 . 
XX 

PR 06-DEC-2000; 2000US-0251398P. 
XX 

PA (ZHUS/) ZHU S. 

PA (GUEG/) GUEGLER K. 

PA (WEBS/) WEBSTER M. 

PA (DFRA/) DI FRANCESCO V. 

PA (BEAS/) BEASLEY E M. 

XX 

PI Zhu S, Guegler Webster M, Di Francesco V, Beasley EM; 
XX 

DR WPI; 2003-401333/38. 

DR N-PSDB; ABX95684, ABX95685. 

XX 

PT Novel human protease protein related to metalloprotease subfamily and 

PT nucleic acid molecule encoding the protein for diagnosing, treating 

PT disease or condition mediated by the protease protein e.g. cancer, 

PT cachexia. 
XX 

PS Claim 1; Fig 2; 64pp; English. 
XX 

CC The invention relates to an isolated human protease peptide (a member of 

CC the disintegrin/metalloprotease with thrombospondin motif s-1 family), an 

CC allelic variant or orthologue encoded by a nucleic acid molecule that 

CC hybridises under stringent conditions to the opposite strand of the cDNA 

CC and gene appearing as (ABX95684 and ABX95685) or a fragment of having 10 

CC contiguous amino acids. Also included are the encoding nucleic acids 

CC (including their complements, allelic variants, orthologues or 

CC fragments), an isolated antibody that selectively binds to the protease, 

CC a gene chip comprising the nucleic acids, a transgenic non-human animal 

CC comprising the nucleic acids, vectors and host cells producing the 

CC protein. The protease is useful for identifying a modulator of the 

CC expression of the protease or binds to it. The protease and the nucleic 

CC acids are useful for treating, preventing and/or diagnosing disorders 

CC such as cancer and cachexia. The protease and the nucleic acids are 

CC further useful as a query sequence to perform a search against sequence 

CC databases to identify other family members or related sequences. The 

CC nucleic acids are useful as primers and probes for e.g. in situ 

CC hybridisation or chromosomal localisation, and for the synthesis of 

CC antisense molecules. The antibody useful to isolate, purify and detect 

CC the presence of the protease in cells tissues. The gene encoding the 

CC protease is located on human chromosome 11. The present sequence 

CC represents the human protease 

XX 

SQ Sequence 950 AA; 

Query Match 99.8%; Score 5156; DB 6; Length 950; 
Best Local Similarity 99.9%; Pred. No. 0; 

Matches 949; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 1 MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

Illlllllllllllllll lllllllllllllllllllllllllllllllllllllllll 
Db 1 MLLLGILTLAFAGRTAGGSEPEREWVPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

Qy 61 QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

llllllllllllllllilllllllllllllllllllllllllllllllllllllllllll 
Db 61 QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

Qy 121 GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

IIIIIIIIIIIIIIIIIIIIMMIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

Db 121 GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 



Qy 


181 


NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 


240 




1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 M 11 1 i 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 M M M M M 1 1 1 M 1 1 H 




Db 


181 


NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 


240 


Qy 


241 


YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 


300 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 > 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 i 1 M 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 M 1 1 1 M 1 M 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 M 




Ob 


241 


YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 


300 


Qy 


301 


KVSDKHPEYWDTAILFTRQDLCGATTCDTLrGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 


360 




1 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 i 1 1 1 1 i 1 r 1 1 1 1 1 1 i 1 1 1 1 1 ( 1 1 1 1 1 1 1 1 i 1 1 1 1 i 1 1 1 1 1 1 1 

1 1 1 1 1 M M M 1 1 1 M 1 1 M 1 M M 1 1 M 1 1 1 M 1 1 1 1 1 M M i 1 1 M 




Db 


301 


KVSDKHPEYWDTAILFTRQDliCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 


360 


Qy 


361 


HELGH VFNM PH DNVK VCE E VFGK LRANHMM S PTL I Q I DRAN PWSACSAAIITDFLDSGHG 


>i 1 n 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 11 1 M 

1 M 1 1 1 M 1 1 M 1 M 1 M 1 1 1 1 1 1 1 M 1 1 M M M M M M M 1 1 1 M 1 ^ 




Db 


361 


HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRAN PWSACSAAIITDFLDSGHG 


420 


Qy 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 






1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 

1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 i M 




Db 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 


Qy 


481 


QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 


540 






I 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 i t 1 1 1 1 1 1 1 1 1 1 1 1 

I I M M 1 1 1 1 M M M 1 M 1 M 1 1 1 M M M 1 M 1 1 1 1 1 1 1 1 1 1 M 




Db 


481 


QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 


540 


Qy 


541 


RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M M 1 M 1 1 M M M M 1 1 1 1 M 1 1 1 1 1 1 1 M 




Db 


541 


RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 


Qy 


601 


VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 


660 




1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 M M 




Db 


601 


VAWVPK YSGVS PRDKCKL I CRANGTG YF YVLAPK WDGTLCS PDSTSVCVQGKC I KAGCD 


660 


Qy 


661 


GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNF WAI PAGASS IDIRQRGYKGLI 


720 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 M ^ 




Db 


661 


GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNF WAI PAGASS IDIRQRGYKGLI 


720 


Qy 


721 


GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 


780 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ( 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M M 1 M 1 1 1 M 1 1 1 M 1 1 1 ^ 




Db 


721 


GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 


780 


Qy 


781 


VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 


840 




1 t 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M M 1 1 M 1 M M 1 1 1 M 1 1 1 M M i 1 1 1 




Db 


781 


VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 




Qy 


841 


ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 


900 




lllllillllllllllllllllllllilllllllllllMllllllllilllllllllil 




Db 


841 


ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 


900 


Qy 


901 


AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 








IIIIIIIIIIIIIIIIIIIIMIIIIIIIilllllllllMIIIIIIIII 




Db 


901 


AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 





RESULT 4 
ADQ88214 



ID ADQ88214 standard; protein; 950 AA. 

XX 

AC ADQ88214; 

XX 

DT 21-OCT-2004 (first entry) 

XX 

DE Human 65552 protein, a metalloprotease disintegrin 15 protein ADAMTS15. 

XX 

KW human; cardiovascular disorder; thrombotic disorder; 

KW differential expression; gene therapy; aberrant vascularisation; 

KW atherosclerosis; thrombosis; coronary artery disease; hyperlipidaemia; 

KW dyslipidaemia; high blood pressure; heart failure; cardiant; 

KW thrombolytic; anticoagulant; antilipaemic; hypotensive; cardiant; 

KW metalloprotease disintegrin 15; ADAMTS15. 

XX 



OS Homo sapiens. 
XX 

PN WO2004063340-A2 . 
XX 

PD 29-JUL-2004. 
XX 

PF 13-JAN-2004; 2004WO-US000393 . 
XX 

PR 13-JAN-2003; 2003US-0439683P. 

PR 05-FEB-2003; 2003US-0445216P. 

PR 18-FEB-2003; 2003US-0448036P. 

PR 12-MAR-2003; 2003US-0454189P. 

PR 25-MAR-2003; 2003US-0457541P. 

PR 29-APR-2003; 2003US-0466411P. 

PR 08-MAY-2003; 2003US-0469041P. 

PR lO-JUN-2003; 2003US-0477414P. 

PR 13-JUN-2003; 2003US-0478560P. 

PR 24-JUL-2003; 2003US-0489772P. 

PR 28-JUL-2003; 2003US-0490660P . 

PR 03-SEP-2003; 2003US-0499838P. 

PR 22-SEP-2003; 2003US-0504786P , 

PR 24-SEP-2003; 2003US-0505570P . 

PR 17-OCT-2003; 2003US-0512418P . 

PR 27-OCT-2003; 2003US>0514660P . 
XX 

PA {MILL-) MILLENNIUM PHARM INC. 
XX 

PI Stagliano NE, Healy A, Acton SL, Galvin KM, Donoghue MA; 

PI Rogrigue-Way A, Tomlinson JE; 

XX 

DR WPI; 2004-553729/53. 

DR N-PSDB; ADQ88213 . 
XX 

PT Identifying a compound for treating a cardiovascular or thrombotic 

PT disorder by combining a compound to be tested with e.g., a 9380, 9462, 

PT 87 01 or 2419 polypeptide or with a host cell expressing the polypeptide 

PT emd detecting the binding. 

XX 

PS Claim 1; SEQ ID NO 56; 512pp; English. 
XX 

CC This invention relates to a novel compound that is capable of treating a 

CC cardiovascular or thrombotic disorder. Specifically, it refers to the 

CC identification of nucleic acid molecules, and the encoded proteins 

CC thereof, which are differentially expressed in cardiovascular disease 

CC states relative to their normal expression in non-diseased tissue. The 

CC present invention describes test compounds (i.e. small molecules, 

CC peptides or antibodies) that can bind to and modulate the activity of 

CC these, differentially expressed membrane -bound polypeptides, where binding 

CC is detected by a competition binding assay, immunoassay or yeast two- 

CC hybrid assay. Accordingly, pharmaceutical compositions can be developed 

CC and used via gene therapy to treat aberrant vascular isat ion, 

CC atherosclerosis, thrombosis, coronary artery disease, hyperlipidaemia, 

CC dyslipidaemia, high blood pressure or heart failure. As such, they 

CC exhibit cardiant, thrombolytic, anticoagulant, antilipaemic , hypotensive 

CC and cardiant activities. This polypeptide sequence is a human protein 

CC that is differentially expressed in a patient with a cardiovascular 

CC disorder, given in an exemplification of the invention. 

XX 

SQ Sequence 950 AA; 

Query Match 99.8%; Score 5156; DB 8; Length 950; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 949; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

llllllllllllllllli IIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIII 
Db 1 MLLLGILTLAFAGRTAGGSEPERE\AA/PIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 



Qy 61 QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

lllllllllllllllllllllllllllllllllllllllllllllilllllllllllMI 

Db 61 QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 



GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

llllllllllllllllllllllllllllllllllllilllllllllllllllllllllll 
NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

IIIMillllllllllllllllllllllllllllllllllllilllllllllllllMM 

YLLTLLATAARLYRHPSILNPINI VWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 360 

HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 420 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 420 

DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 480 

lllllilllllllllllllllllllllllllllllllllllllllllMIMMIIilll 

DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 480 
QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 540 

IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIII 

QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 540 

RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 600 

VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 660 

llllllllllllilllllllllllllllllllllllllllllllllllllllllllMII 

VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 660 
GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 720 

IIIIMIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMI 

GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 720 
GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPI LEPLT 780 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 780 

VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 840 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 840 

ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 900 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIII 

ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 900 

AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 

llllllllllllllllllllllllllllllllllllllllllllllllll 
AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 



RESULT 5 
ABR40092 

ID ABR40092 standard; protein; 950 AA. 
XX 

AC ABR40092; 
XX 

DT 30-JUN-2003 (first entry) 
XX 

DE Human ADAMTS-15. 
XX 

KW Human; cytostatic; ant i arthritic ; analgesic; antiinflammatory; leukaemia; 

KW immunosuppressive; ADAMTS; disintegrin; metal loprotease; thrombospondin; 

KW ADAMTS-15; ADAMTS-16; ADAMTS- 17; ADAMTS- 18; ADAMTS-19; wound healing; 





121 


Db 


121 




181 


Db 


181 


Ov 


241 


Db 


241 




301 


Db 


301 


wy 


361 


Db 


361 


vy 


421 


Db 


421 


wy 


481 


LfO 


481 


uy 


541 


UD 




vy 


601 


uo 


601 


Ov 
vy 


661 




661 


Ov 
vy 


721 


uo 


721 


Ov 

yy 


7 81 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 



KW extracellular matrix; cancer; metastasis; arthritis; pain; coagulation; 

KW immune disorder; inflammation; immune response; angiogenesis; rheumatism; 

KW embryo implantation; foetal development; nerve degeneration. 
XX 

OS Homo sapiens. 
XX 

PN WO2003027282-A1. 
XX 

PD 03-APR-2003. 
XX 

PF 24-SEP-2002; 2002WO-JP009771 . 
XX 

PR 24-SEP-2001; 2001ES-00002165 . 

PR 24-SEP-2001; 2001ES-00002166 . 

PR 24-SEP-2001; 2001ES-00002167 . 

PR 25-SEP-2001; 2001ES-00002192 . 

PR 25-SEP-2001; 2001ES-00002193 . 
XX 

PA (DAII-) DAIICHI FINE CHEM CO LTD. 

PA (UYOV-) UNIV OVIEDO. 

XX 

PI Cal Miguel S, Obaya Gonzalez AJ, Llamazares Prada M; 

PI Garabaya Fernandez C, Lopez -Ot in C; 

XX 

DR WPI; 2003-313640/30. 

DR N-PSDB; ACC00342 . 
XX 

PT ADAMTS family zinc metalloproteases for treatment and investigation of 

PT metalloprotease related disorders including cancer and arthritis. 

XX 

PS Claim 3; Page 113-118; 169pp; Japanese. 
XX 

CC The present invention relates to novel human ADAMTS (A disintegrin and 

CC metalloprotease domain-thrombospondin domain) family proteins: ADAMTS-15, 

CC ADAMTS-16, ADAMTS-17, ADAMTS-18 and ADAMTS-19 (ACC00342-ACC00346 and 

CC ABR40092-ABR40096) . The ADAMTS proteins and their coding sequences are 

CC useful for the treatment, prevention, diagnosis and investigation of 

CC disorders associated with the extracellular matrix, such as cancer, 

CC cancer metastasis, leukaemia, arthritis, pain, immune disorders and 

CC inflammation; investigation of related processes such as immune response, 

CC angiogenesis, coagulation, wound healing, embryo implantation, foetal 

CC development, cancer proliferation and infiltration, metastasis, 

CC rheumatism and nerve degeneration 
XX 

SQ Sequence 950 AA; 

Query Match 99.7%; Score 5149; DB 6; Length 950; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 948; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 
Qy 1 MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

IIIIIIIIMIIIIIIII IIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIM 

Db 1 MLLLGILTLAFAGRTAGGSEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

Qy 61 QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

IIIIMIIIIIUlllllllllllllllilllliMIIMIIIIIIlMIIIIIIMIM 

Db 61 QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

Qy 121 GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

IIIIIMIIIIIIIMIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIMMIIIII 

Db 121 GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

Qy 181 NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

lllllllllllillMIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIMMIII 

Db 181 NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

Qy 241 YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

IIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIillMIII 

Db 241 YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 



Qy 



301 KVSDKHPEYWDTAI LFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVI EDDGLPSAFTTA 360 



Db 


301 


KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 


360 


Qy 


361 


HELGHVFNMPHD^fVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 


420 




I 1 1 1 1 1 1 1 i 1 1 1 i 1 t 1 1 1 1 1 t 1 1 1 1 1 i i 1 1 1 1 i 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 

I I M M 1 1 1 1 M 1 M 1 M M 1 M 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 M 1 1 1 M 




Db 


361 


HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 


420 


Qy 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 




1 1 1 1 1 1 1 1 1 1 1 i 1 i 1 1 1 1 i i 1 1 1 1 1 1 t 1 1 i 1 1 1 1 1 1 1 1 t ( 1 t 1 1 1 1 t t 1 1 1 1 i M 1 1 1 1 1 

1 M 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 M 1 i 1 1 1 1 i M 




Db 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 


Qy 


481 


QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 


540 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 > 1 1 1 i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 ) 1 1 1 1 i 1 1 M 

1 1 1 1 1 1 M 1 M 1 1 M M 1 1 1 1 M M 1 1 1 1 M 1 1 1 1 1 1 1 M 1 M 1 1 1 M 1 




Db 


481 


QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 


540 


Qy 


541 


RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 i 1 1 1 1 i i 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 M 

1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 11 1 1 M M 1 1 M M 1 1 1 M 1 1 M 




Db 


541 


RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 


Qy 


601 


VAWVPK Y SGVS PRDKCKLI CRANGTGY F YVLAPK WDGTLC S PDSTSVCVQGKCI KAGCD 


660 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 ] 1 1 1 1 1 1 1 1 i 1 i 1 1 1 1 1 1 

1 M 1 1 1 1 1 1 1 M 1 1 M 1 M M 1 1 M M 1 1 1 1 M 1 1 1 M 1 1 M 1 M 1 1 M 1 1 1 i 1 1 11 1 1 




Db 


601 


VAWVPKYSGVSPGDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 


660 


Qy 


661 


GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 


720 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ) 1 1 1 1 1 1 i 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 M 1 i 

1 M M 1 1 M 1 1 M M 1 1 1 1 1 1 M M M M i M M M 1 1 M 1 1 1 i 1 1 1 1 M 




Db 


661 


GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 


720 


Qy 


721 


GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 


780 




1 1 1 1 > 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 M M M M 1 1 1 M M 1 1 M 1 1 M i 1 1 1 H 




Db 


721 


GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 


780 


Qy 


781 


VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 


840 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 > 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 H M 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 M 1 1 1 1 M 




Db 


781 


VE VL S VGKMTP PRVR Y S F YL PKE PREDK S S H PKD PRG P S VLHNS VL S L SNQVEQPDDRP P 


n 

o4U 


Qy 


841 


ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 


900 




IIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIMIIIIIIMIIilll 




Db 


841 


ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 


900 


Qy 


901 


AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 






IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIilllllllllll 




Db 


901 


AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 





RESULT 6 
AAU74751 



ID AAU74751 standard; protein; 952 AA. 
XX 

AC AAU74751; 
XX 

DT 09-APR-2002 (first entry) 
XX 

DE Human protease PRTS-11 protein sequence. 
XX 

KW Human; protease; PRTS; gastrointestinal; Crohn's disease; cancer; 

KW cardiovascular; atherosclerosis; autoimmune disorder; dermatitis; 

KW inflammatory disorder; acquired immunodeficiency syndrome; AIDS; 

KW cell proliferative disorder; developmental disorder; epilepsy; 

KW Duchenne muscular dystrophy; epithelial disorder; neurological disorder; 

KW reproductive disorder; endometriosis. 

XX 

OS Homo sapiens. 
XX 

PN WO200198468-A2. 
XX 

PD 27-DEC-2001. 
XX 

PF 13-JUN-2001; 2001WO-US019178 . 
XX 

PR 16-JUN-2000; 2000US-0212336P . 

PR 22-JUN-2000; 2000US-0213955P . 



PR 29-JUN-2000; 2000US-0215396P. 

PR 07-JUL-2000; 2000US-0216821P. 

PR 14-JUL-2000; 2000US-0218946P . 
XX 

PA (INCY-) INCYTE GENOMICS INC. 
XX 

PI Yue H, Elliott VS, Gandhi AR, Lai P, Au- Young J, Tribouley CM; 

PI Delegeane AM, Baughn MR, Nguyen DB, Lee EA, Hafalia A, Khan FA; 

PI Walia NK, Yao MG, Lu DAM, Patterson C, Tang YT, Walsh RT; 

PI Azimzai Y, Lu Y, Ramkumar J, Xu Y, Reddy R, Das D, Kearney L; 

PI Kallick DA; 

XX 

DR WPI; 2002-090437/12. 

DR N-PSDB; ABK12894 . 
XX 

PT Twenty one human proteases (referred to as PRTS-1 to PRTS-21), useful in 

PT the diagnosis, treatment and prevention of gastrointestinal (e.g. 

PT gastritis) , cardiovascular (e.g. atherosclerosis) and cell proliferative 

PT (e.g. cancer ) disorders . 

XX 

PS Claim 1; Page 144-146; 177pp; English. 
XX 

CC The present invention relates to twenty one new human proteases, referred 

CC to as PRTS-1 to PRTS-21. The PRTS polynucleotides and polypeptides of the 

CC invention are useful in the diagnosis, treatment and prevention of 

CC gastrointestinal e.g. gastritis, esophageal carcinoma and Crohn's 

CC disease, cardiovascular e.g. atherosclerosis, hypertension and myocardial 

CC infarction, autoimmune/ in flsuiunatory e.g. acquired immunodeficiency 

CC syndrome (AIDS), allergies and osteoarthritis, cell proliferative e.g. 

CC cancer, developmental e.g. Duchenne and Becker muscular dystrophy, 

CC epithelial e.g. dermatitis, neurological e.g. epilepsy and Alzheimer's 

CC disease and reproductive e.g. infertility and endometriosis disorders. 

CC Numerous other examples of each disorder are given in the specification. 

CC The present protein sequence represents the human protease PRTS-11 

CC protein of the invention 

XX 

SQ Sequence 952 AA; 

Query Match 99.0%; Score 5111; DB 5; Length 952; 

Best Local Similarity 99.4%; Pred. No. 0; 

Matches 946; Conservative 0; Mismatches 4; Indels 2; Gaps 2; 

Qy 1 MLLLGILTLAFAGRTAGGFEPEREWVPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

IIIIIIIIIMIIIIIII IIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIII 

Db 1 MLLLGILTLAFAGRTAGGSEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

Qy 61 QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIMIIIIilllllllll 

Db 61 QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

Qy 121 GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIMIIIIIIIIIMIIIIIII 

Db 121 GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

Qy 181 NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

IIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIllllllllll 

Db 181 NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

Qy 241 YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII 

Db 241 YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

Qy 301 KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 360 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 301 KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 360 

Qy 361 HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 420 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIII 

Db 361 HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 420 



Qy 



421 DCLLDQPSKPI SLPEDLPGAS YTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 480 



Illllllllllllllllllllllllllllllllllllllllllllllllllillllllll 



Db 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 


Qy 


481 


QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 


540 






1 1 1 < 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M M 1 




Db 


481 


QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 


540 


Qy 


541 


RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 






1 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 M M 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M M 1 1 M 1 M 




Db 


541 


RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 


Qy 


601 


VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPK-WDGTLCSPDSTSVCVQGKCIKAGC 


659 




1 1 1 1 1 1 1 1 1 1 1 1 1 t t 1 1 1 1 1 1 1 1 1 1 i 1 1 i t 1 ( i 1 1 1 1 1 1 1 1 1 1 t ( 1 1 i 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 M 1 M 1 M M 1 1 1 1 1 1 1 1 M M 1 1 M 1 1 M M 1 1 1 1 1 1 1 1 1 




Db 


601 


VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKVWDGTLCSPDSTSVCVQGKCIKAGC 


660 


Qy 


660 


DGNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAI PAGASS IDIRQRGYKGL 


719 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 i 1 1 1 1 M M M 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M ^ 




Db 


661 


DGNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGL 


720 


Qy 


720 


IGDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPL 


779 




1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t i 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 M M M 1 1 M 1 M 1 M M 1 1 M M M 1 1 1 1 1 1 1 1 M M 1 1 M 1 1 1 1 1 ^ 




Db 


721 


IGDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPL 


780 


Qy 


780 


TVEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPR-GPSVLHNSVLSLSNQVEQPDDR 


838 




1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II II 1 1 II 1 II 1 1 1 II 1 1 1 1 




Db 


781 


TVEVL S VGKMT P PRVR Y S FY L PKE PREDKS SH P PH PRGGP S VLHNS VLS L SNQVEQPDDR 


o40 


Qy 


839 


PPARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWE 


898 




lllllllllllllilllllllllllll IIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 




Db 


841 


PPARWVAGSWGPCSASCGSGLQKRAVDWRGSAGQRTVPACDAAHRPVETQACGEPCPTWE 


900 


Qy 


899 


LSAWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 






llllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


901 


LSAWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 952 





RESULT 7 
ABP70062 

ID ABP70062 standard; protein; 924 AA. 
XX 

AC ABP70062; 
XX 

DT 27-JAN-2003 (first entry) 
XX 

DE Human N0V5a. 
XX 

KW Human; anti-HIV; cytostatic; antidiabetic; antiasthmatic; cachexia; AIDS; 

KW antiinflammatory; cardiant; haemostatic; neuroprotective; anorectic; 

KW nootropic; immunosuppressive; osteopathic; antiparkinsonian; cancer; 

KW ant i infertility; cerebroprotective; gene therapy; NOVX; NOV; fertility; 

KW metabolic disorder; diabetes; obesity; infectious disease; anorexia; 

KW neurodegenerative disease; Alzheimer's disease; Parkinson's disease; 

KW immune disorder; haematopoietic disorder; cardiovascular disorder; 

KW bronchial asthma; dyslipidemia; metabolic disturbance; neurogenesis; 

KW metabolic syndrome X; wasting disorder; cell differentiation; 

KW cell proliferation; haematopoiesis; wound healing; angiogenesis . 
XX 

OS Homo sapiens. 
XX 

PN WO200272771-A2 . 
XX 

PD 19-SEP-2002. 
XX 

PF 08-MAR-2002; 2002WO-US007288 . 
XX 

PR 08-MAR-2001; 2001US-0274101P . 

PR 08-MAR-2001; 2001US-0274194P . 

PR 08-MAR-2001; 2001US-0274281P . 

PR 08-MAR-2001; 2001US-0274322P . 

PR 09-MAR-2001; 2001US-0274849P . 

PR 12-MAR-2001; 2001US-0275235P . 



PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 
PI 
PI 
PI 
PI 
PI 
XX 
DR 
DR 
XX 
PT 



13-MAR-2001 
13-MAR-2001 

13- MAR-2001 

14- MAR-2001 
16-MAR-2001 

19- MAR-2001 

20- MAR-2001 
20-MAR-2001 
20-MAR-2001 

20- MAR-2001 

21- MAR-2001 

22- MAR-2001 

23- MAR-2001 

26- MAR-2001 

27- MAR-2001 

27- MAR-2001 

28- MAR-2001 
30-MAR-2001 
30-MAR-2001 
02-APR-2001 
02-APR-2001 
02-APR-2001 
04-APR-2001 

13- APR-2001 
30-APR-2001 

02- MAY-2001 

03- MAY-2001 
03-MAy-2001 

15- MAY-2001 

16- MAY-2001 
16-MAY-2001 

30- MAY-2001 

31- MAY-2001 
31-MAY-2001 

18- JUN-2001 

19- JUN-2001 
19-JUN-2001 
lO-JUL-2001 
31-JUL-2001 
16-AUG-2001 
lO-SEP-2001 
12-SEP-2001 
27-SEP-2001 
27-SEP-2001 
18-OCT-2001 
31-OCT-2001 

14- NOV-2001 
14-NOV-2001 
14-NOV-2001 
14-NOV-2001 
14-NOV-2001 
21-NOV-2001 
03-DEC-2001 

03- DEC-2001 

04- DEC-2001 
03-JAN-2002 
08-MAR-2002 



2001US-0275578P. 
2001US-0275579P. 
2001US-0275601P. 
2001US-0276000P. 
2001US-0276776P. 
2001US-0276994P. 
2001US-0277239P. 
2001US-0277321P. 
2001US-0277327P. 
2001US-0277338P. 
2001US-0277791P. 
2001US-0277833P. 
2001US-0278152P. 
2001US-0278894P. 
2001US-0278999P. 
2001US-0279036P. 
2001US-0279344P. 
2001US-0279995P. 
2001US-0280233P. 
2001US-0280802P. 
2001US-0280822P. 
2001US-0280900P. 
2001US-0281194P. 
2001US-0283675P. 
2001US-0287424P. 
2001US-0288066P. 
2001US-0288342P. 
2001US-0288528P. 
2001US-0291190P. 
2001US-0291099P. 
2001US-0291240P. 
2001US-0294485P. 
2001US-0294889P. 
2001US-0294899P. 
2001US-0299027P. 
2001US-0299303P. 
2001US-0299310P. 
2001US-0304354P. 
2001US-0309198P. 
2001US-0312903P. 
2001US-0318462P. 
2001US-0318770P. 
2001US-0325430P. 
2001US-0325681P. 
2001US-0330380P. 
2001US-0335301P. 
2001US-0332172P. 
2001US-0332271P. 
2001US-0332272P. 
2001US-0333184P. 
2001US-0333272P. 
2001US-0332094P. 
2001US-0337426P. 
2001US-0338092P. 
2001US-0337185P. 
2002US-0345705P. 
2002US-00093463. 



(CURA-) CURAGEN CORP. 

Rastelli h, Mezes PD, Smithson G, Guo X, Gerlach V, Gasman SJ; 
Boldog FL, Li L, Zerhusen BD, Tchernev VT, Gangolli EA, Vernet CAM; 
Pena CEA, Burgess CE, Liu X, Spytek KA, Gorman L, Spaderna SK; 
Voss EZ, Malyankar UM, Anderson DW, Patturajan M, Miller CE; 
Taupier RJ, Padigaru M, Shenoy SG, Kekuda R, Gusev VY, Pochart PF; 
Zhong M; 

WPI; 2002-732824/79. 
N-PSDB; ABV99340. 

New NOVX polypeptides and polynucleotides, useful for preventing. 



PT diagnosing or treating NOVX-associated disorders e.g. diabetes, cemcer, 

FT Alzheimer's disease, dyslipidemias, obesity, immune or hematopoietic 

PT disorders, and asthma. 
XX 

PS Claim 1; Page 103; 619pp; English. 
XX 

CC The present invention relates to new isolated proteins (NOVX) and their 

CC coding sequences (ABV99327-ABV99595 and ABP70049-ABP70149) , where X is 

CC any number from 1 to 48. The NOVX proteins and coding sequences are 

CC useful in the manufacture of a mediccunent for treating a syndrome 

CC associated with a humcui disease, preferably a NOVX-associated disorder. 

CC The NOVX coding sequences eund proteins are useful for treating, 

CC preventing or diagnosing diseases such as metabolic disorders, diabetes, 

CC obesity, infectious disease, anorexia, cancer-associated cachexia, 

CC cancer, neurodegenerative diseases, Alzheimer's disease, Parkinson's 

CC disease, immune disorders, haematopoietic disorders, cardiovascular 

CC disorders, fertility, bronchial asthma, AIDS, dyslipidemia, metabolic 

CC disturbances associated with obesity, metabolic syndrome X or wasting 

CC disorders associated with chronic diseases or various cancers. The NOVX 

CC coding sequences and proteins may also be used as targets for the 

CC identification of small molecules that modulate or inhibit e.g. 

CC neurogenesis, cell differentiation, cell proliferation, haematopoiesis, 

CC wound healing and angiogenesis, in gene therapy, in generation of 

CC antibodies that bind immunospecif ically to NOVX substcuices for use in 

CC therapeutic or diagnostic methods 

XX 

SQ Sequence 924 AA; 

Query Match 95.2%; Score 4915; DB 5; Length 924; 

Best Local Similarity 95.8%; Pred. No. 0; 

Matches 913; Conservative 2; Mismatches 6; Indels 32; Gaps 3 

MLLLGILTLAFAGRTAGGFEPEREWVPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

Illlllllllllllllll IIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIII 

MLLLGILTLAFAGRTAGGSEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 



120 
120 



240 



lllllllllllllll M IIIIIMIIIIIIMIIIIIIIIIilllllllllll lllllll 

QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 

GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIII 

GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 

NPAI LRALDPYKPRRAGFGESRSRRRSGRAKRFVSI PRYVETLWADESMVKFHGADLEH 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 

YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 

llllllltllllllllllllllllllllllllllllllllllllllllllllllllllll 
YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 

KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 

lllllillllllllllllllllllllllllllllllllllllMIIIIIIIIIIIIIIII 



HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 420 

MIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAI ITDFLDSGHG 420 

DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 480 

IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMllllllllllllllllllll 

DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 480 

QTRHFPWADGTSCGEGKLCLKGACVERHNLNKH RVDGSWAKWDPYGPCSRTCGGGVQ 537 

IIIIIIIIIIIMIMIIMIIIIIIIIIIIII Mllllllllllllllllllllll 

QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHSSSQVDGSWAKWDPYGPCSRTCGGGVQ 540 



Qy 

Db 



538 LARRQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRL 597 

IIIMIIIIMIIIIIIMIIIMMIIIMIIII lllllllllllllllllllllll 

541 LARRQCTNPTPANGGKYCEGVRVKYRSCNLEPCPS— SGKSFREEQCEAFNGYNHSTNRL 598 



Qy 


598 


TLAVAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKA 


657 




iiiilliiiiiiiiiiillllllllllilllllltlllllllliilllllllllllilil 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 i 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


599 


TLAVAWVPK YSGVS PRDKCKL I CRANGTG YF YVLAPKWDGTLCS PDSTSVCVQGKC I KA 


658 


Qy 


658 


GCDGNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYK 


717 




M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ' i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


659 


GCDGNLGSKKRFDKCGVCGGDNKSCKKVTGLLSPARHGYNFWAIPAGASSIDIRQRGYK 


718 


Qy 


718 


GLIGDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILE 


777 




iiiilliiiiiilililililtlllllllllllllllllllllillllllllllllllli 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


719 


GLIGDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSIiLRYSGTGTAVESLQASRPILE 


778 


Qy 


778 


PLTVEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDD 


837 




llllllllllllllllllllllilllilllllll 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


779 




812 


Qy 


838 


RPPARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTW 


897 






lllllllllllllllllllllllllll lllllllllllllllllllllllllllllll 




Db 


813 


-PPARWVAGSWGPCSASCGSGLQKRAVDWRGSAGQRTVPACDAAHRPVETQACGEPCPTW 


871 


Qy 


898 


ELSAWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 








lllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


872 


ELSAWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 924 





RESULT 8 
AAU72899 

ID AAU72899 standard; protein; 928 AA. 
XX 

AC AAU72899; 
XX 

DT 26-FEB-2002 (first entry) 
XX 

DE Human metalloprotease partial protein sequence #11. 
XX 

KW Human; protease; PGR primer; cytostatic; immunomodulator ; cardiant; 

KW vasotropic; antimigraine; analgesic; endocrine; nootropic; tranquiliser ; 

KW hypertensive; hypotensive; neuroleptic; neuroprotective; anabolic; 

KW anorectic; antiinflammatory; aspartyl protease; cysteine protease; 

KW metalloprotease; serine protease; cancer; haematopoietic; breast; colon; 

KW lung; prostrate; cervical; brain; ovarian; bladder; kidney; pain; 

KW immune-related disease; cardiovascular disease; neuronal disease; 

KW migraine; sexual dysfunction; mood disorder; attention disorder; 

KW cognition disorder; hypotension; hypertension; psychotic disorder; 

KW dyskinesia; metabolic disorder; inflammatory disorder. 

XX 

OS Homo sapiens . 

XX ^ — ' "^"^ ' 

PN ('"WO200183782-A2^^^^..^ 

PD 08-NOV-2001. 
XX 

PF 04-MAY-2001; 2001WO-US014431 . 
XX 

PR 04-MAY-2000; 2000US-0201879P . 
XX 

PA (SUGE-) SUGEN INC. 
XX 

PI Plowman GD, Whyte D, Sudarsanam S, Manning G, Caenepeel S; 

PI Payne V; 

XX 

DR WPI; 2002-041502/05. 

DR N-PSDB; AAS97182. 
XX 

PT Novel protease polypeptide useful for screening for substsmces that may 

PT be used to treat, e.g., cancers, immune-related diseases, cardiovascular 

PT disease, migraine, pain, psychotic and inflammatory disorders. 
XX 

PS Claim 28; Fig 2G; 232pp; English. 



CC The invention relates to cui isolated, enriched, or purified protease 

CC polypeptide (I) and polynucleotide (II) encoding (I). (I) may be used to 

CC screen for substances (S) that may modulate its activity. Administering S 

CC (which modulates protease activity in vitro) may be used to treat a 

CC disease or disorder selected from cancers (e.g., of tissues, of blood or 

CC haematopoietic origin, of the breast, colon, lung, prostrate, cervical, 

CC brain, ovarian, bladder or kidney) , immune- related diseases and 

CC disorders, cardiovascular disease, brain or neuronal -associated diseases 

CC (e.g., central or peripheral nervous system diseases, migraine, pain, 

CC sexual dysfunction, mood disorders, attention disorders, cognition 

CC disorders, hypotension, hypertension, psychotic disorders, neurological 

CC disorders and dyskinesias), metabolic disorders and inflammatory 

CC disorders. (I) may also be useful as a diagnostic tool for a disease or 

CC disorder such as those above. AAU72876-AAU72910 represent human protease 

CC amino acid sequences of the invention 

XX 

SQ Sequence 928 AA; 

Query Match 94.0%; Score 4856.5; DB 5; Length 928; 

Best Local Similarity 94.7%; Pred. No. 0; 

Matches 901; Conservative 0; Mismatches 1; Indels 49; Gaps 2; 

MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

llllllllllllllllll lllllllllllllllllllllllllllllllllllllllll 
MLLLGILTLAFAGRTAGGSEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 85 

QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

IIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 145 
GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

llllllllllllllllllllllllilllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 205 

NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

lllllllinilllllMIIIMIIIIIMIIIMIIIIIIMIIIIIIMIIIIIIIII 

NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 265 

YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIilllllMIIIIIIIIIIIIII 

YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 325 

KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 360 

IIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIMMII 

KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 385 
HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 420 

IIMIIIIIIIIIIIIIIIIIMIlllllllllllMIIIIIIIMIIIIIIIIIIItll 

HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAI ITDFLDSGHG 445 

DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 480 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 505 

QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 540 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIilllllMlllltllllllllM 

QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 565 

RQCTNPTPANGGK YCEGVRVKYRSCNLEPC PS SASGKSFREEQCEAFNGYNH STNRLTLA 600 

IIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIM 

RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 625 
VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPK-WDGTLCSPDSTSVCVQGKCIKAGC 659 

IIIIIIMIIIMIIIIIIIIIIIIIIIIIIIII IIIIIIIIMIIIIIIIIIIIIIII 

VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKVWDGTLCSPDSTSVCVQGKCIKAGC 685 
DGNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGL 719 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIII 

DGNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNF WAI PAGASS IDIRQRGY KGL 745 



Qy 


1 


Db 


26 


Qy 


61 


Db 


86 


Qy 


121 


Db 


146 


Qy 


181 


Db 


206 


Qy 


241 


Db 


266 


Qy 


301 


Db 


326 


Qy 


361 


Db 


386 


Qy 


421 


Db 


446 


Qy 


481 


Db 


506 


Qy 


541 


Db 


566 


Qy 


601 


Db 


626 


Qy 


660 


Db 


686 



Qy 



720 IGDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPL 779 



Illllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 746 IGDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRP 800 

Qy 780 TVEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRP 839 

Illllllllllllllll 

Db 801 NSVLSLSNQVEQPDDRP 817 

Qy 840 PARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWEL 899 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 818 PARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWEL 877 

Qy 900 SAWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 

lllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 878 SAWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 928 

RESULT 9 
ABU08383 

ID ABU08383 standard; protein; 823 AA. 
XX 

AC ABU08383; 
XX 

DT 05-JUN-2003 (first entry) 
XX 

DE Human matrix metalloproteinase (MMP) 65552. 
XX 

KW Human; matrix metalloproteinase; MMP 65552; carcinoma; sarcoma; 

KW cellular proliferation disorder; differentiation disorder; leukaemia; 

KW lung cancer; breast cancer; hormonal disorder; diabetes mellitus; 

KW thyroid disorder; fertility disorder; hepatic disorder; schizophrenia; 

KW Wilson's disease; Gaucher ' s disease; neurological disorder; ischaemia; 

KW Alzheimer's disease; Parkinson's disease; Huntington's disease; asthma; 

KW inflammatory disorder; Crohn's disease; immune disorder; hypertension; 

KW autoimmune disease; multiple sclerosis; psoriasis; allergy; arthritis; 

KW cardiovascular disorder; atherosclerosis; heart failure; bone disorder; 

KW motility disorder; developmental disorder; skeletal disorder; obesity; 

KW osteoporosis; osteomalacia; pain disorder; metabolic disorder; cachexia; 

KW anorexia nervosa; blood disorder; clotting disorder; thrombocytopaenia; 

KW cytostatic; antidiabetic; antithyroid; antiinf ertility; hepatotropic; 

KW nootropic; neuroprotective; neuroleptic; anticonvulsant; vasotropic; 

KW antiinflammatory; immunosuppressive; antipsoriatic; antiasthmatic; 

KW antiallergic; antiarthritic; antiarteriosclerotic; hypotensive; anaemia; 

KW cardiant; osteopathic; analgesic; anorectic; antidepressant; haemostatic; 

KW inununomodulator ; enzyme. 

XX 

OS Homo sapiens. 
XX 

PN US2002197703-A1. 
XX 

PD 26-DEC-2002. 
XX 

PF 05-JUN-2002; 2002US-00163 316 . 
XX 

PR 13-JUN-2001; 2001US-0297863P. 
XX 

PA (MILL-) MILLENNIUM PHARM INC. 
XX 

PI Kapeller-Libermann R; 
XX 

DR WPI; 2003-352751/33. 

DR N-PSDB; ABX94139. 
XX 

PT New human matrix metalloproteinase nucleic acid and polypeptide 

PT molecules, designated 65552, useful for diagnosing, preventing or 

PT treating cancers, diabetes, multiple sclerosis, asthma, obesity, 

PT hypertension, pain or psoriasis. 
XX 

PS Claim 17; Page 42-44; 55pp; English. 
XX 

CC The present invention relates to the isolation of a novel human matrix 

CC metalloproteinase (MMP), and the polynucleotide sequence encoding it. The 



CC MMP 65552 polynucleotide and polypeptide sequences are useful for 

CC diagnosing, preventing, alleviating or treating conditions associated 

CC with aberrant expression or activity of 65552 nucleic acids or 

CC polypeptides, e.g. cellular proliferation and/or differentiation 

CC disorders (e.g. carcinoma, sarcoma, leukaemia, lung or breast cancer), 

CC hormonal disorders (e.g. diabetes mellitus, thyroid disorders, fertility 

CC disorders), hepatic disorders (e.g. Wilson's disease, Gaucher's disease), 

CC neurological disorders (e.g. schizophrenia, ischaemia, Alzheimer's 

CC disease, Parkinson's disease, Huntington's disease), inflammatory 

CC disorders (e.g. Crohn's disease), immune disorders (e.g. autoimmune 

CC disease, multiple sclerosis, psoriasis, asthma, allergy, arthritis), 

CC cardiovascular disorders (e.g. atherosclerosis, hypertension, heart 

CC failure), motility disorders, developmental disorders, skeletal or bone 

CC disorders (e.g. osteoporosis, osteomalacia), pain disorders, metabolic 

CC disorders (e.g. obesity, anorexia nervosa, cachexia) , and blood or 

CC clotting disorders (e.g. anaemia or thrombocytopaenia) . The present 

CC sequence represents hioman MMP 65552 

XX 

SQ Sequence 823 AA; 



Query Match 82.2%; Score 4242.5; DB 6; Length 823; 

Best Local Similarity 96,9%; Pred. No. 0; 

Matches 791; Conservative 0; Mismatches 2; Indels 23; Gaps 1; 

Qy 1 MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

llllllllllllllllll lllllllllllllllllllllllllllllllllllllllll 
Db 1 MLLLGILTLAFAGRTAGGSEPEREWVPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

Qy 61 QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

IIMIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII llllllllllllll 

Db 61 QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGGVNAEPDSFAAVSLC 120 

Qy 121 GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVAS(3W 180 

IMIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIII 

Db 121 GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

Qy 181 NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

IIIIIIIMIIIMIIIIIIIIilMlllillllllillllllllllllllMIIIIIII 

Db 181 NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

Qy 241 YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

llllllltilllllllllllllllllllllllllllllllllilllllllllllllllll 

Db 241 YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

Qy 301 KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 360 

llllliMIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 

Db 301 KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 360 

Qy 361 HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 420 

IIIIMMIMIilMIIIMIIIMIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIM 

Db 361 HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 420 

Qy 421 DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 480 

MlllilMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIMIIIIIIII 

Db 421 DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 480 
Qy 481 QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHR VDG 517 

IIIIIIIIIIIIIIMIIIIIIilllllllllll III 

Db 481 QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRPPTDIISPKQLLLRLPNGLHTTQVDG 540 

Qy 518 SWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGK 577 

llllllllilllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 541 SWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGK 600 

Qy 578 SFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWD 637 

IIIIIIIMIIIMIIIIIIIIIIIIMIIillllllllllllllllllllMIIIIIII 

Db 601 SFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWD 660 

Qy 638 GTLCSPDSTSVCVQGKCIKAGCDGNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYN 697 

II III Mill Mill II III llllllllllllll II III lllll llllllllllllll II 
Db 661 GTLCSPDSTSVCVQGKCIKAGCDGNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYN 720 



Qy 698 FWAIPAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLL 757 

llllllllllllllllllllllllllllllllllllllllllllllllllllllilllll 
Db 721 FWAIPAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLL 780 

Qy 758 RYSGTGTAVESLQASRPILEPLTVEVLSVGKMTPPR 793 

llllllllllllllllllllllllllllllllllll 
Db 781 RYSGTGTAVESLQASRPILEPLTVEVLSVGKMTPPR 816 

RESULT 10 
ABP70063 

ID ABP70063 standard; protein; 755 AA. 
XX 

AC ABP70063; 
XX 

DT 27-JAN-2003 (first entry) 
XX 

DE Human NOVSb. 
XX 

KW Human; anti-HIV; cytostatic; antidiabetic; antiasthmatic; cachexia; AIDS; 

KW antiinflammatory; cardiant; haemostatic; neuroprotective; anorectic; 

KW nootropic; immunosuppressive; osteopathic; antiparkinsonian; cancer; 

KW an ti in fertility; cerebroprotective; gene therapy; NOVX; NOV; fertility; 

KW metabolic disorder; diabetes; obesity; infectious disease; anorexia; 

KW neurodegenerative disease; Alzheimer's disease; Parkinson's disease; 

KW immune disorder; haematopoietic disorder; cardiovascular disorder; 

KW bronchial asthma; dyslipidemia; metabolic disturbance; neurogenesis; 

KW metabolic syndrome X; wasting disorder; cell differentiation; 

KW cell proliferation; haematopoiesis; wound healing; angiogenesis . 
XX 

OS Homo sapiens. 
XX 

PN WO200272771-A2. 
XX 

PD 19-SEP-2002. 
XX 

PF 08-MAR-2002; 2002WO-US007288 . 
XX 

PR 08-MAR-2001; 2001US-0274101P . 

PR 08-MAR-2001; 2001US-0274194P . 

PR 08-MAR-2001; 2001US-0274281P. 

PR 08-MAR-2001; 2001US-0274322P. 

PR 09-MAR-2001; 2001OS-0274849P . 

PR 12-MAR-2001; 2001US-0275235P. 

PR 13-MAR-2001; 2001US-0275578P . 

PR 13-MAR-2001; 2001US-0275579P. 

PR 13-MAR-2001; 2001US-0275601P. 

PR 14-MAR-2001; 2001US-0276000P. 

PR 16-MAR-2001; 2001US-0276776P . 

PR 19-MAR-2001; 2001US-0276994P . 

PR 20-'MAR*2001; 2001US-0277239P . 

PR 20-MAR-2001; 2001US-0277321P . 

PR 20-MAR-2001; 2001US-0277327P . 

PR 20-MAR-2001; 2001US-0277338P . 

PR 21-MAR-2001; 2001US-0277791P . 

PR 22-MAR-2001; 2001US-0277833P . 

PR 23-MAR-2001; 2001US-0278152P . 

PR 26-MAR-2001; 2001US-0278894P . 

PR 27-MAR-2001; 2001US-0278999P . 

PR 27-MAR-2001; 2001US-0279036P. 

PR 28-MAR-2001; 2001US-0279344P. 

PR 30-MAR-2001; 2001US-0279995P . 

PR 30-MAR-2001; 2001US-0280233P . 

PR 02-APR-2001; 2001US-0280802P . 

PR 02-APR-2001; 2001US-0280822P . 

PR 02-APR-2001; 2001US-0280900P . 

PR 04-APR-2001; 2001US-0281194P . 

PR 13-APR-2001; 2001US-0283675P . 

PR 30-APR-2001; 2001US-0287424P . 

PR 02-MAY-2001; 2001US-0288066P . 
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XX 

PI 
PI 
PI 
PI 
PI 
PI 

XX 
DR 
DR 
XX 
PT 
PT 
PT 
PT 
XX 
PS 
XX 
CC 

cc 

CC 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

XX 
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15- MAY- 

16- MAY- 
16-MAY- 

30- MAY- 
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31-MAY- 

18- JUN 

19- JUN- 
19-JUN 
10-JUL 
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16-AUG 
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12-SEP- 
27-SEP 
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14 -NOV 
14 -NOV 
14 -NOV 
14 -NOV 
14 -NOV 
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03-JAN 
08-MAR 



■2001 
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2001 
-2001 
-2001 
-2001 
-2001 
-2001 
-2001 
-2001 
-2001 
-2001 
-2001 
-2001 
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■2002 



2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2001US 
2002US 
2002US 



-0288342P. 
-0288528P. 
•0291190P. 
•0291099P. 
•0291240P. 
-0294485P. 
■0294889P. 
-0294899P. 
•0299027P. 
-0299303P. 
-0299310P. 
•0304354P. 
-0309198P. 
-0312903P. 
-0318462P. 
-0318770P. 
-0325430P. 
-0325681P. 
-0330380P. 
-0335301P. 
-0332172P. 
-0332271P. 
-0332272P. 
-0333184P. 
-0333272P. 
-0332094P. 
-0337426P. 
-0338092P. 
-0337185P. 
-0345705P. 
-00093463. 



(CURA-) CURAGEN CORP. 

Rastelli L, Mezes PD, Smithson G, Guo X, Gerlach V, Gasman SJ; 
Boldog FL, Li L, Zerhusen BD, Tchernev VT, Gangolli EA, Vernet CAM; 
Pena CEA, Burgess CE, Liu X, Spytek KA, Gorman L, Spaderna SK; 
Voss EZ, Malyankar UM, Anderson DW, Patturajan M, Miller CE; 
Taupier RJ, Padigaru M, Shenoy SG, Kekuda R, Gusev VY, Pochart PF; 
Zhong M; 

WPI; 2002-732824/79. 
N-PSDB; ABV99341. 

New NOVX polypeptides and polynucleotides, useful for preventing, 
diagnosing or treating NOVX-associated disorders e.g. diabetes, Ccuicer, 
Alzheimer's disease, dyslipidemias , obesity, immune or hematopoietic 
disorders, and asthma. 

Claim 1; Page 104; 619pp; English. 

The present invention relates to new isolated proteins (NOVX) and their 
coding sequences (ABV99327-ABV99595 and ABP70049-ABP70149} , where X is 
any number from 1 to 48. The NOVX proteins and coding sequences are 
useful in the manufacture of a medicament for treating a syndrome 
associated with a human disease, preferably a NOVX-associated disorder. 
The NOVX coding sequences and proteins are useful for treating, 
preventing or diagnosing diseases such as metabolic disorders, diabetes, 
obesity, infectious disease, anorexia, cancer-associated cachexia, 
cancer, neurodegenerative diseases, Alzheimer's disease, Parkinson's 
disease, immune disorders, haematopoietic disorders, cardiovascular 
disorders, fertility, bronchial asthma, AIDS, dyslipidemia , metabolic 
disturbances associated with obesity, metabolic syndrome X or wasting 
disorders associated with chronic diseases or various cancers. The NOVX 
coding sequences and proteins may also be used as targets for the 
identification of small molecules that modulate or inhibit e.g. 
neurogenesis, cell differentiation, cell proliferation, haematopoiesis, 
wound healing and angiogenesis , in gene therapy, in generation of 
antibodies that bind immunospecif ically to NOVX substances for use in 
therapeutic or diagnostic methods 



SQ Sequence 755 AA; 

Query Match 76.6%; Score 3957.5; DB 5; Length 755; 

Best Local Similarity 79.4%; Pred. No. 1.5e-298; 

Matches 754; Conservative 0; Mismatches 1; Indels 195; Gaps 1; 

MLLLGILTLAFAGRTAGGFEPEREWVPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

llllllllllllllllll llllllllllllllllllllllllllillllllllllllll 
MLLLGILTLAFAGRTAGGSEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

NPAI LRALDPYKPRRAGFGESRSRRRSGRAKRFVS I PRYVETLWADESMVKFHGADLEH 240 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

lllilllllllllllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 
YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


320 


Qy 


421 


Db 


320 


Qy 


481 


Db 


320 


Qy 


541 


Db 


346 


Qy 


601 


Db 


406 


Qy 


661 


Db 


466 


Qy 


721 


Db 


526 


Qy 


781 


Db 


586 


Qy 


841 


Db 


646 


Qy 


901 


Db 


706 



IIIIIIIIIIIIMII 



319 



319 



319 

QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 540 

llllllllllllllllllllllllll 
VDGSWAKWDPYGPCSRTCGGGVQLAR 345 

RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 600 

IIIIIIIMIIIIMIIIIIIIIIIIIIilllllllllllllMllilllMIIIIIIII 

RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 405 
VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 660 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIMIIIIIIIIIMIIIIIIMII 

VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 465 

GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 720 

lllllllllillllllllllllllllllllllllllMIIMIIIIIIIIIIIMMIII 
GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 525 

GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 780 

llllllllllllllllllllllllilllllllllllllllllllllllllllllllllM 

GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 585 

VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 840 

IIIIIIIIIIIIMIIillMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 645 

ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 900 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMII 
ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 705 

AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 

IMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 755 



RESULT 11 
AAB21257 

ID AAB21257 standard; protein; 505 AA. 
XX 

AC AAB21257; 
XX 

DT 23-FEB-2001 (first entry) 
XX 

DE Rat metalloproteinase ADAMTS-5. 
XX 

KW Rat; ADAMTS-5; metalloproteinase; ADAM; 

KW a disintegrin and metalloproteinase domain; thrombospondin domain; 

KW vaccine; nootropic; neuroprotective; antiparkinsonian; cerebroprotective; 

KW cytostatic; antiarthritic; immunosuppressive; Alzheimer's disease; 

KW Parkinson's disease; stroke; cancer; arthritis; autoimmune disease; 

KW brain tumour; brain injury. 

XX 

OS Rattus norvegicus . 
XX 

PN WO200053774-A2. 
XX 

PD 14-SEP-2000. J \ 



PR ^^^-MAR-1999T^ 99US-00264585 . 

PA "(NEUR-) NEUROCRINE BIOSCIENCES INC. 
XX 

PI Kelner GS, Clark M, Maki RA; 
XX 

DR WPI; 2000-594326/56. 

DR N-PSDB; AAA95827 . 
XX 

PT Polynucleotide encoding novel members of a disintegrin, metalloproteinase 

PT and thrombospondin domain protein family used to prevent and treat 

PT Alzheimer's disease, cancer and autoimmune diseases. 
XX 

PS Claim 12; Fig 14; 129pp; English. 
XX 

CC The present sequence is rat metalloproteinase ADAMTS-5. The ADAMTS family 

CC of proteins is closely related to the ADAM (A Disintegrin and 

CC Metalloproteinase Domain) family. Members of the ADAMTS family contain a 

CC thrombospondin domain in addition to the disintegrin and 

CC metalloproteinase domains found in the ADAMs . ADAMTS polypeptides are 

CC useful for the manufacture of medicaments for treating conditions 

CC associated with neuroinf lammation and/or neurodegeneration, such as 

CC Alzheimer's disease, Parkinson's disease and stroke. They are also useful 

CC for treating conditions associated with cell proliferation, cell 

CC migration, inflammation and/or angiogenesis, such as cancer, arthritis 

CC and autoimmune diseases. They can be used to treat patients afflicted 

CC with an invasive tumour, a brain tumour or brain injury 

XX 

SQ Sequence 505 AA; 

Query Match 48,4%; Score 2497.5; DB 3; Length 505; 
Best Local Similarity 93.5%; Pred. No. 3.4e-185; 

Matches 462; Conservative 8; Mismatches 21; Indels 3; Gaps 2; 

Qy 89 QGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLCGGLRGAFGYRGAEYVISPLPNASAPAAQ 148 





Db 




Db 



Qy 




Qy 209 RAKRFVSIPRYVETLWADESMVKFHGADLEHYLLTLLATAARLYRHPSILNPINIVWK 268 

IIIIIIIIIMIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 133 RAKRFVSIPRYVETLWADESMVKFHGADLEHYLLTLLATAARLYRHPSILNPINIWVK 192 



Qy 269 VLLLRDRDSGPKVTGNAALTLRNFCAWQKKLNKVSDKHPEYWDTAILFTRQDbCGATTCD 328 

Mil llhlllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 193 VLLLGDRDTGPKVTGNAAIiTLRNFCAWQKKLNKVSDKHPEYWDTAILFTRQDLCGATTCD 252 

Qy 329 TLGMADVGTMCDPKRSCSVIEDDGLPSAFTTAHELGHVFNMPHDNVKVCEEVFGKLRANH 388 

llllilllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 253 TLGMADVGTMCDPKRSCSVIEDDGLPSAFTTAHELGHVFNMPHDNVKVCEEVFGKLRANH 312 

Qy 389 MMSPTLIQIDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPISLPEDLPGASYTLSQQC 448 

llllllllllllllllllllllllllllllllllllllllllhlllllll Ihlllll 

Db 313 MMSPTLIQIDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPITLPEDLPGTSYSLSQQC 372 

Qy 449 ELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKLCLKGACVERH 508 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII llllllllll 
Db 373 ELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKFCLKGACVERH 432 

Qy 509 NLNKHRVDGSWAKWDPYGPCSRTCGGGVQLARR--QCTNPTPANGGKYCEGVRVKYRSCN 566 

I 11:1111 lllhllllllllllll mil II II llllllllllllllll 
Db 433 NPNKYRVDGPWAKWEPYGPCSRTCGGGAQLARRQVQATLPLP-TGGKYCEGVRVKYRSCN 491 

Qy 567 LEPCPSSASGKSFR 580 

lllillllllllll 
Db 492 LEPCPSSASGKSFR 505 



RESULT 12 
ABU08387 

ID ABU08387 standard; protein; 968 AA. 
XX 

AC ABU08387; 
XX 

DT 05-JUN-2003 (first entry) 
XX 

DE Murine matrix metalloproteinase. 
XX 

KW Murine; matrix metalloproteinase; MMP; carcinoma; sarcoma; mouse; 

KW cellular proliferation disorder; differentiation disorder; leukaemia; 

KW lung cancer; breast cancer; hormonal disorder; diabetes mellitus; 

KW thyroid disorder; fertility disorder; hepatic disorder; schizophrenia; 

KW Wilson's disease; Gaucher 's disease; neurological disorder; ischaemia; 

KW Alzheimer's disease; Parkinson's disease; Huntington's disease; asthma; 

KW inflammatory disorder; Crohn's disease; immune disorder; hypertension; 

KW autoimmune disease; multiple sclerosis; psoriasis; allergy; arthritis; 

KW cardiovascular disorder; atherosclerosis; heart failure; bone disorder; 

KW motility disorder; developmental disorder; skeletal disorder; obesity; 

KW osteoporosis; osteomalacia; pain disorder; metabolic disorder; cachexia; 

KW anorexia nervosa; blood disorder; clotting disorder; thrombocytopaenia; 

KW cytostatic; antidiabetic; antithyroid; ant iinf ertility ; hepatotropic ; 

KW nootropic; neuroprotective; neuroleptic; anticonvulsant; vasotropic; 

KW antiinflammatory; immunosuppressive; antipsoriat ic ; antiasthmatic; 

KW antiallergic; antiarthrit ic ; ant iarteriosclerotic ; hypotensive; anaemia; 

KW cardiant; osteopathic; analgesic; anorectic; antidepressant; haemostatic; 

KW immunomodulator ; enzyme. 

XX 

OS Mus musculus. 
XX 

PN US2002197703-A1. 
XX 

PD 26-DEC-2002. 
XX 

PF 05-JUN-2002; 2002US-00163316 . 
XX 

PR 13-JUN-2001; 2001US-0297863P. 
XX 

PA (MILL-) MILLENNIUM PHARM INC. 
XX 

PI Kapeller-Libermann R; 
XX 

DR WPI; 2003-352751/33. 
XX 

PT New human matrix metalloproteinase nucleic acid and polypeptide 



PT molecules, designated 65552, useful for diagnosing, preventing or 

PT treating cancers, diabetes, multiple sclerosis, asthma, obesity, 

PT hypertension, pain or psoriasis. 
XX 

PS Disclosure; Page 48-50; 55pp; English. 
XX 

CC The present invention relates to the isolation of a novel human matrix 

CC metal loproteinase (MM?) , and the polynucleotide sequence encoding it. The 

CC MM? 65552 polynucleotide and polypeptide sequences are useful for 

CC diagnosing, preventing, alleviating or treating conditions associated 

CC with aberrant expression or activity of 65552 nucleic acids or 

CC polypeptides, e.g. cellular proliferation and/or differentiation 

CC disorders (e.g. carcinoma, sarcoma, leukaemia, lung or breast cancer), 

CC hormonal disorders (e.g. diabetes mellitus, thyroid disorders, fertility 

CC disorders), hepatic disorders (e.g. Wilson's disease, Gaucher 's disease), 

CC neurological disorders (e.g. schizophrenia, ischaemia, Alzheimer's 

CC disease, Parkinson's disease, Huntington's disease), inflammatory 

CC disorders (e.g. Crohn's disease), immune disorders (e.g. autoimmune 

CC disease, multiple sclerosis, psoriasis, asthma, allergy, arthritis), 

CC cardiovascular disorders (e.g. atherosclerosis, hypertension, heart 

CC failure), motility disorders, developmental disorders, skeletal or bone 

CC disorders (e.g. osteoporosis, osteomalacia), pain disorders, metabolic 

CC disorders (e.g. obesity, anorexia nervosa, cachexia), and blood or 

CC clotting disorders (e.g. anaemia or thrombocytopaenia) . The present 

CC sequence represents a murine MMP 

XX 

SQ Sequence 968 AA; 



Query Match 48.1%; Score 2482.5; DB 6; Length 968; 

Best Local Similarity 48.0%; Fred. No. 1.2e-183; 

Matches 478; Conservative 158; Mismatches 249; Indels 111; Gaps 22; 



Qy 


1 


MLLLGILTLAFAGRTAGG--FEPEREVWPIRLDPDINGRRyYWRGP-EDSGDQGLIFQI 

:||| :|: III 1 : hhl h 1 1 II 1 
LLLLASITMLLCARGAHGRPTEEDEELVLP-SLE RAPGHDSTTTRL--RL 


57 


Db 


37 


83 


Qy 


58 


TAFQEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGS SDLRRCFYSGDVNA 

II : :|l II: Mil h : : 1 : M M HIM H 


109 


Db 


84 


DAFGQQLHLKLQPDSGFLAPGFTLQTV GRSPGSEAQHLDPTGDLAHCFYSGTVNG 


138 


Qy 


110 


EPDSFAAVSLCGGLRGAFGYRGAEYVISPLPNAS APAAQRNSQGA HLLQRRG 

:| 1 IIMII hllll :| |: 1 1 1 : III 1 MIMI 
DPGSAAALSLCEGVRGAFYLQGEEFFIQPAPGVATERLAPAVPEEESSARPQFHILRRR- 


161 


Db 


139 


197 


Qy 


162 


VPGGPSGDPTSRCGVASGWNPAILRALDPYKPRRAGFGESRSRRR 

1 ::|ll 1 1 Ih: 1 
RRGSGGAKCGVMD DETLPTSDSRPESQNTRNQWPVRDPTPQDAGKP 


206 


Db 


198 


243 


Qy 


207 


SGRAKRFVSIPRYVETLWADESMVKFHGADLEHYLLTLLATAARLYRHPSILNPI 

1 1 lllll MlllhMlhM III: hllllll : III hllll 1 1 
SGPGS IRKKRFVS S PRYVETMLVADQSMADFHGSGLKHYLLTLFSVAARF YKH PS I RNS I 


262 


Db 


244 


303 


Qy 


263 


NIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLNKVSDKHPEYWDTAILFTRQDLC 

::||||:|:: : MMI llllllllll MM 1 IM IhMIMIIIIIIII 
SLVWKILVIYEEQKGPEVTSNAALTLRNFCNWQKQHNSPSDRDPEHYDTAILFTRQDLC 


322 


Db 


304 


363 


Qy 


323 


GATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTAHELGHVFNMPHDNVKVCEEVFG 

|: IMIIIIIIMhIM IIIMIIllll Mllllllllllllllll: 1 1 : 1 
GSHTCDTLGMADVGTVCDPSRSCSVIEDDGLQAAFTTAHELGHVFNMPHDDAKHCASLNG 


382 


Db 


364 


423 


Qy 


383 


KLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPISLPEDLPGASY 

:|:|: | :| : III III ::| 1 1 1 : II 1 : 1 1 : 1 : 1 II II Mil 1 
VTGDSHLMASMLSSLDHSQPWSPCSAYMVTSFLDNGHGECLMDKPQNPIKLPSDLPGTLY 


442 


Db 


424 


483 


Qy 


443 


TLSQQCELAFGVGSKPCP-YMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKLCLK 

::||: || || II II lllll : 1 : 1 1 1 h 1 1 II 1 1 1 1 1 1 II 1 1 M 
DANRQCQFTFGEESKHCPDAASTCTTLWCTGTSGGLLVCQTKHFPWADGTSCGEGKWCVS 


501 


Db 


484 


543 


Qy 


502 


GACVERHNLNKH RVDGSWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGV 

1 II : II 1 Ml 1 Ml llllllllll Ml II 1 IMMIII 
GKCVNKTDM-KHFATPVHGSWGPWGPWGDCSRTCGGGVQYTMRECDNPVPKNGGKYCEGK 


558 


Db 


544 


602 



Qy 559 RVKYRSCNLEPCPSSASGKSFREEQCEAFNGY^FHSTNRLTLAVAWVPKYSGVSPRDKCKL 618 

Ihllllhl II : :||:||||llll I II llhlllhhill 

Db 603 RVRYRSCNIEDCPDN-NGKTFREEQCEAHNEFSKASFGNEPTVEWTPKYAGVSPKDRCKL 661 



Qy 619 ICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCDGNLGSKKRFDKCGVCGGD 678 

I I I llhll lllllll illllllllllhhlMII : llhlllllllll: 
Db 662 TCEAKGIGYFFVLQPKWDGTPCSPDSTSVCVQGQCVKAGCDRIIDSKKKFDKCGVCGGN 721 

Qy 679 NKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLN 738 

:|||::|: | Ih :| lllll-l::: I =1 : -Ih: : I hii 

Db 722 GSTCKKMSGIVTSTRPGYHDIVTIPAGATNIEVKHRNQRGSRNNGSFLAIRAADGTYILN 781 

Qy 739 GHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLTVEVLSVGKMTPPRVRYSF 798 

|:| :| :|:|| ||::|lll|: hi h lllhHI II 1: = ::- 

Db 782 GNFTLSTLEQDLTYKGTVLRYSGSSAALERIRSFSPLKEPLTIQVLMVGHALRPKIKFTY 841 

Qy 799 YLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPPARWVAGSWGPCSASCGSG 858 

|: |:: : I II II II =1111 

Db 842 FMKKKTES FNAIPTFS EWVIEEWGECSKTCGSG 874 

Qy 859 LQKRAVDCRGSAGQRTVPACDAAH--RPVETQACGE-PCPTWELSAWSPCSKSCGRGFQR 915 

|:| III I II : I :| h I : III \ I I I I I I : I I : I = : : 

Db 875 WQRRWQCRDINGH PASECAKEVKPASTRPCADLPCPHWQVGDWSPCSKTCGKGYKK 931 

Qy 916 RSLKCVGHGGRLLARDQCNLHRKPQE-LDFCVLRPC 950 

hllll I I :h : h :|h \ W I I 
Db 932 RTLKCVSHDGGVLSNESCDPLKKPKHYIDFCTLTQC 967 



RESULT 13 
ADG72483 

ID ADG72483 standard; protein; 949 AA. 
XX 

AC ADG72483; 
XX 

DT ll-MAR-2004 (first entry) 
XX 

DE Human aggrecanase (ADAMTS-1) . 
XX 

KW Human; aggrecanase; ADAMTS-1; rheumatoid arthritis; osteoarthritis; 

KW antiarthritic ; osteopathic; enzyme, 

XX 

OS Homo sapiens. 
XX 

FN US6649377-B1 . 
XX 

PD 18-NOV-2003. 
XX 

PF 09-MAY-2000; 2000US-00568559 . 
XX 

PR lO-MAY-1999; 99US-0133343P. 
XX 

PA (SYNT ) SYNTEX USA LLC. 
XX 

PI Allard JD, Heller RA, Klonowski P, Vanwart HE; 
XX 

DR WPI; 2003-896162/82. 

DR N-PSDB; ADG72482. 
XX 

PT New nucleic acid present in other than its natural environment, useful 

PT for preparing a composition for diagnosing or treating diseases 

PT associated with aggrecanase activity, e.g. rheumatoid arthritis or 

PT osteoarthritis. 
XX 

PS Disclosure; SEQ ID NO 2; 26pp; English. 
XX 

CC The present invention relates to the isolation of human aggrecanase 

CC (ADAMTS-1), and the polynucleotide sequence encoding it. Also disclosed 

CC are is a method of producing aggrecanase in vitro. The polynucleotide 

CC sequence encoding aggrecanase is useful for preparing a composition for 



CC diagnosing or treating diseases associated with aggrecanase activity e.g. 

CC rheumatoid arthritis or osteoarthritis. The present sequence represents 

CC human aggrecanase. 
XX 

SQ Sequence 949 AA; 

Query Match 48.0%; Score 2480.5; DB 7; Length 949; 

Best Local Similarity 48.4%; Pred. No. 1.7e-183; 

Matches 482; Conservative 153; Mismatches 250; Indels 111; Gaps 22; 

Qy 1 MLLLGILTLAFAGRTAGGFEPERE\AA/PIRLDPDINGRRyYWRGPEDSGDQGLIFQITAF 60 

:||| II : I : hill h III II 
Db 18 LLLLAAALLAVSDALGRPSEEDEELWP-ELE RAP GHGTTRLRLHAF 63 

Qy 61 QEDFYLHLTPDAQFLAPAFSTEHLG VPLQGLTGGSSDLRRCFYSGDVNAEPDSF 114 

: I I Ih Mil h :::| II HI lllll II :| I 

Db 64 DQQLDLELRPDSSFLAPGFTLQNVGRKSGSETPLP ETDLAHCFYSGTVNGDPSSA 118 

Qy 115 AAVSLCGGLRGAFGYRGAEYVISPLPNAS APAAQRNSQGA HLLQRRGVPGGPS 167 

Ihlll hllll I I Mil II I II I llhl 
Db 119 AALSLCEGVRGAFYLLGEAYFIQPLPAASERLATAAPGEKPPAPLQFHLLRR NRQ 173 

Qy 168 GDPTSRCGV ASGWNPAILRALDPYKPRRAGFGESR 202 

II Ml : |: I I I h 

Db 174 GDVGGTCGWDDEPRPTGKAETEDEDEGTEGEDEGAQWS PQDPALQGVGQP- 224 

Qy 203 SRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEHYLLTLLATAARLYRHPSILNPI 262 

: I I Mill IIMhMlhM Mlh hllllM : MMhIIII I : 

Db 225 TGTGSIRKKRFVSSHRYVETMLVADQSMAEFHGSGLKHYLLTLFSVAARLYKHPSIRNSV 284 

Qy 263 NIWVKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLNKVSDKHPEYWDTAILFTRQDLC 322 

::||lhh: I IhM IIIIIIIIM Mh I Ih hMIIIIIIIIIII 
Db 285 SLVWKILVIHDEQKGPEVTSNAALTLRNFCNWQKQHNPPSDRDAEHYDTAILFTRQDLC 344 

Qy 323 GATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTAHELGHVFNMPHDWKVCEEVFG 382 

h MIMMIIIMMM MMMMIII MilMIIIMIIIMIh \ \ \ 

Db 345 GSQTCDTLGMADVGTVCDPSRSCSVIEDDGLQAAFTTAHELGHVFNMPHDDAKQCASLNG 404 

Qy 383 KLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPISLPEDLPGASY 442 

: Mlh I M : Ml Ml Ml llhMhIhhl M II MM II 

Db 405 VNQDSHMMASMLSNLDHSQPWSPCSAYMITSFLDNGHGECLMDKPQNPIQLPGDLPGTSY 464 

Qy 443 TLSQQCELAFGVGSKPCP-YMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKLCLK 501 

:Mh M II II h HIM : I MMhIIIIMIIIMIII h 

Db 465 DANRQCQFTFGEDSKHCPDAASTCSTLWCTGTSGGVLVCQTKHFPWADGTSCGEGKWCIN 524 

Qy 502 GACVERHNLNKH RVDGSWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGV 558 

III:: M III I hi IIIIIIIMI hi M I IIMIMI 

Db 525 GKCVNKTH-RKHFDTPFHGSWGMWGPWGDCSRTCGGGVQYTMRECDNPVPKNGGKYCEGK 583 

Qy 559 RVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKL 618 

IhlllllM II : :||:|llllill I :: :: II hllhlHhhlll 

Db 584 RVRYRSCNLEDCPDN-NGKTFREEQCEAHNEFSKASFGSGPAVEWIPKYAGVSPKDRCKL 642 

Qy 619 ICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCDGNLGSKKRFDKCGVCGGD 678 

MM I MhM IIMIII IIMIMIIIIhhIIIII : Mhllllllllh 

Db 643 ICQAKGIGYFFVLQPKWDGTPCSPDSTSVCVQGQCVKAGCDRIIDSKKKFDKCGVCGGN 702 

Qy 679 NKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLN 738 

Mlh:| I Ih :: II lh:h::M :| = ::|hl : I hll 

Db 703 GSTCKKISGSVTSAKPGYHDIITIPTGATNIEVKQRNQRGSRNNGSFLAIKAADGTYILN 762 

Qy 739 GHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLTVEVLSVGKMTPPRVRYSF 798 

I : M :hh: II :|IMh Ml : = : h lllh:|hll h:M:: 
Db 763 GDYTLSTLEQDIMYKGWLRYSGSSAALERIRSFSPLKEPLTIQVLTVGNALRPKIKYTY 822 

Qy 799 YLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPPARWVAGSWGPCSASCGSG 858 

:: h II h: \ II II II II I 

Db 823 FVKKK KES FNAIPTFS AWVIEEWGECSKSCELG 855 



Qy 



859 LQKRAVDCRGSAGQRTVPACDAAH— RPVETQACGE-PCPTWELSAWSPCSKSCGRGFQR 915 



|:| |:|l II II : I :| h I : III hi II ll|:|h|::: 
Db 856 WQRRLVECRDINGQ PASECAKEVKPASTRPCADHPCPQWQLGEWSSCSKTCGKGYKK 912 

Qy 916 RSLKCVGHGGRLLARDQCNLHRKPQE-LDFCVLRPC 950 

Mill: I I :h : h :|h =111 : I 
Db 913 RSLKCLSHDGGVLSHESCDPLKKPKHFIDFCTMAEC 948 



RESULT 14 
ADZ21014 

ID ADZ21014 standard; protein; 949 AA. 
XX 

AC ADZ21014; 
XX 

DT 14-JUL-'2005 (first entry) 
XX 

DE Hximan aggrecanase (ADAMTS-1), SEQ ID NO: 2 #1. 
XX 

KW Protein degradation; musculoskeletal disease; rheumatoid arthritis; 

KW osteoarthritis; gout; psoriatic arthritis; spondylosis; injury; fibrosis; 

KW pulmonary disease; inflammation; antiarthritic; osteopathic; 

KW antirheumatic; antigout; antiinflammatory; respiratory-gen.; ADAMTS-1; 

KW aggrecanase; enzyme. 

XX 

OS Homo sapiens. 
XX 

PN US2005100916-A1. ■ 
XX 

PD 12-MAY-2005. 
XX 

PF 17-SEP-2003; 2003US-00667281 . 
XX 

PR lO-MAY-1999; 99US-0133343P . 

PR 09-MAY-2000; 2000US-00568559 . 
XX 

PA (ALLA/) ALLARD J D. 

PA (HELL/) HELLER R A. 

PA (KLON/) KLONOWSKI P. 

PA (VWAR/) VAN WART H E. 
XX 

PI Allard JD, Heller RA, Klonowski P, Van Wart HE; 
XX 

DR WPI; 2005-345393/35. 

DR N-PSDB; ADZ21013. 
XX 

PT New isolated human aggrecanase useful for diagnosing or treating 

PT conditions associated with aggrecanase activity, e.g. rheumatoid 

PT arthritis, osteoarthritis, joint trauma or fibrosis. 
XX 

PS Claim 33; SEQ ID NO 2; 26pp; English. 
XX 

CC The invention relates to human aggrecanase (ADZ21014 and ADZ21015) , also 

CC known as ADAMTS-1 (a disintegrin and metalloproteinase with 

CC thrombospondin motifs 1), and its fragments. The invention also discloses 

CC nucleic acids encoding human aggrecanase (ADZ21013), methods of treating 

CC disease conditions associated with aggrecanase activity, and methods of 

CC modulating aggrecanase activity in a cell. In arthritic cartilage, the 

CC major cartilage proteoglycan aggrecan is cleaved by proteases to produce 

CC 55 kD and 60 kD fragments. Aggrecanase is responsible for generation of 

CC the 60 kD fragment, cleaving aggrecan between Glu373 and Ala374 in a 

CC region located between globular domains Gl and G2 to yield a fragment 

CC with the N-terminal sequence shown in ADZ21016. Cleavage in this region 

CC leads to the loss of aggrecan and exposes type II collagen to 

CC collagenases , causing cartilage loss and the consequent development of 

CC joint disease. Human aggrecanase and nucleic acids encoding it are useful 

CC in the diagnosis or treatment of conditions associated with aggrecanase 

CC activity such as rheumatoid arthritis, osteoarthritis, infectious 

CC arthritis, gouty arthritis, psoriatic arthritis, spondylosis, sports 

CC injury, joint trauma, pulmonary disease and fibrosis. They may also be 

CC used for research purposes and in drug screening procedures. The present 

CC sequence represents human aggrecanase. Note: The present sequence is 1 



CC residue shorter than the version fo SEQ ID NO: 2 shown in Figure 1. 

CC Although the present sequence is designated SEQ ID NO: 2 in the sequence 

CC listing and the disclosure, an aggrecanase amino acid sequence is 

CC referred to in Claim 1. However, the claim refers to the amino acid 

CC sequence as SEQ ID N0:1, which is the aggrecanase-encoding nucleic acid 

CC in the sequence listing and disclosure. 

XX 

SQ Sequence 949 AA; 

Query Match 48.0%; Score 2480.5; DB 9; Length 949; 

Best Local Similarity 48.4%; Pred. No. 1.7e-183; 

Matches 482; Conservative 153; Mismatches 250; Indels 111; Gaps 22; 

Qy 1 MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

:||| II : I : hill h III II 
Db 18 LLLLAAALLAVSDALGRPSEEDEELWP-ELE RAP GHGTTRLRLHAF 63 

Qy 61 QEDFYLHLTPDAQFLAPAFSTEHLG VPLQGLTGGSSDLRRCFYSGDVNAEPDSF 114 

: I 1 II: UN |: :::| II =11 lllll II =1 I 

Db 64 DQQLDLELRPDSSFLAPGFTLQNVGRKSGSETPLP ETDLAHCFYSGTVNGDPSSA 118 

Qy 115 AAVSLCGGLRGAFGYRGAEYVISPLPNAS APAAQRNSQGA HLLQRRGVPGGPS 167 

Ihlll hllll I II lllll III I llhl 
Db 119 AALSLCEGVRGAFYLLGEAYFIQPLPAASERLATAAPGEKPPAPLQFHLLRR NRQ 173 

Qy 168 GDPTSRCGV ASGWNPAILRALDPYKPRRAGFGESR 202 

M III : h I I I h 

Db 174 GDVGGTCGWDDEPRPTGKAETEDEDEGTEGEDEGAQWS PQDPALQGVGQP- 224 

Qy 203 SRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEHYLLTLLATAARLYRHPSILNPI 262 

: I I lllll llllh:|lhll :|lh hIMIil : llllhllll I : 
Db 225 TGTGSIRKKRFVSSHRYVETMLVADQSMAEFHGSGLKHYLLTLFSVAARLYKHPSIRNSV 284 

Qy 263 NIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLNKVSDKHPEYWDTAILFTRQDLC 322 

::||lhh: I Ihll llllllllll llh I Ih h : I I I I I I I I I M I 
Db 285 SLVWKILVIHDEQKGPEVTSNAALTLRNFCNWQKQHNPPSDRDAEHYDTAILFTRQDLC 344 

Qy 323 GATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTAHELGHVFNMPHDNVKVCEEVFG 382 

h lllllllllllhlll lllllllllll :||llllllllllllllh I I : I 
Db 345 GSQTCDTLGMADVGTVCDPSRSCSVIEDDGLQAAFTTAHELGHVFNMPHDDAKQCASLNG 404 

Qy 383 KLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPISLPEDLPGASY 442 

: :|lh I :| : III III :|l llhllhlhhl II II MM II 
Db 405 VNQDSHMMASMLSNLDHSQPWSPCSAYMITSFLDNGHGECLMDKPQNPIQLPGDLPGTSY 464 

Qy 443 TLSQQCELAFGVGSKPCP-YMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKLCLK 501 

::||: II II II h lllll = I : I II h I i I M II I I I I I II h 

Db 465 DANRQCQFTFGEDSKHCPDAASTCSTLWCTGTSGGVLVCQTKHFPWADGTSCGEGKWCIN 524 

Qy 502 GACVERHNLNKH RVDGSWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGV 558 

III:: II III I hi llllllllll hi II I llllllll 

Db 525 GKCVNKTH-RKHFDTPFHGSWGMWGPWGDCSRTCGGGVQYTMRECDNPVPKNGGKYCEGK 583 

Qy 559 RVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKL 618 

Ihlllllll II : :|hllllllil I II hllhlllhhill 

Db 584 RVRYRSCNLEDCPDN-NGKTFREEQCEAHNEFSKASFGSGPAVEWIPKYAGVSPKDRCKL 642 

Qy 619 ICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCDGNLGSKKRFDKCGVCGGD 678 

Ihl I llhll IIMIII lllllllllllhhillll : llhllllllllh 

Db 643 ICQAKGIGYFFVLQPKWDGTPCSPDSTSVCVQGQCVKAGCDRIIDSKKKFDKCGVCGGN 702 

Qy 679 NKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLN 738 

:|||::| I |h =: II lh:h::|l =1 : ::lhl : I hll 

Db 7 03 GSTCKKISGSVTSAKPGYHDIITIPTGATNIEVKQRNQRGSRNNGSFLAIKAADGTYILN 762 

Qy 739 GHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLTVEVLSVGKMTPPRVRYSF 798 

I : :| :hh: II Hlllh hi h lllh^lhll h-l- 

Db 7 63 GDYTLSTLEQDIMYKGWLRYSGSSAALERIRSFSPLKEPLTIQVLTVGNALRPKIKYTY 822 

Qy 799 YLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPPARWVAGSWGPCSASCGSG 858 

:: h II h^ U II II II II I 



Db 



823 FVKKK KES 



FNAIPTFS 



AWVIEEWGECSKSCELG 855 



Qy 859 LQKRAVDCRGSAGQRTVPACDAAH--RPVETQACGE-PCPTWELSAWSPCSKSCGRGFQR 915 

hi I: II II II : I :| h I : III hi II llhl|:h:: 
Db 856 WQRRLVECRDINGQ PASECAKEVKPASTRPCADHPCPQWQLGEWSSCSKTCGKGYKK 912 

Qy 916 RSLKCVGHGGRLLARDQCNLHRKPQE-LDFCVLRPC 950 

llllh I I :h : h :|h :|ll : I 
Db 913 RSLKCLSHDGGVLSHESCDPLKKPKHFIDFCTMAEC 948 



RESULT 15 
AAY49501 

ID AAY49501 standard; protein; 950 AA. 
XX 

AC AAY49501; 
XX 

DT lO-JAN-2000 (first entry) 
XX 

DE Human METHl protein. 
XX 

KW Human; METHl; METH2 ; anti -angiogenic ; metal loprotease thrombospondin ; 

KW cancer; diagnosis; hyperprol iterative disorder; autoimmune disease; 

KW angiogenesis inhibitor; abnormal wound healing; inflammation; 

KW rheumatoid arthritis; psoriasis; endometrial bleeding disorder; 

KW diabetic retinopathy; macula degeneration; haemangioma; detection; 

KW arterial-venous malformation; immune deficiency. 

XX 

OS Homo sapiens. 
XX 

PN WO9937660-A1. 
XX 

PD 29-JUL-1999, 
XX 

PF 22-JAN-1999; 99WO-US001313 . 
XX 

PR 23-JAN-1998; 98US-0072298P . 

PR 28-AUG-1998; 98US-0098539P . 
XX 

PA (IRUE/) IRUELA-ARISPE L, 

PA (HAST/) HASTINGS G A. 

PA (RUBE/) RUBEN S M. 
XX 

PI Iruela-Arispe L, Hastings GA, Ruben SM; 
XX 

DR WPI; 1999-590684/50. 

DR N-PSDB; AAZ32000. 
XX 

PT New isolated metalloprotease thrombospondin polypeptides, useful for 

PT treating hyperprol iterative disorders, cancers or autoimmune disorders. 
XX 

PS Claim 10; Fig 1; 457pp; English. 
XX 

CC AAZ32000 and AAZ32001 encode, and AAY49501 and AAY49502 represent, human 

CC metalloprotease thrombospondin (METH) proteins METHl and METH2 

CC respectively. METHl and METH2 have been found to be potent inhibitors of 

CC angiogenesis both in vitro and in vivo. They can be used for treating 

CC cancer and other disorders related to angiogenesis including abnormal 

CC wound healing, inf Icimmation, rheumatoid arthritis, psoriasis, endometrial 

CC bleeding disorders, diabetic retinopathy, some forms of macula 

CC degeneration, haemangiomas , and arterial -venous malformations. They may 

CC be useful in treating deficiencies or disorders of the immune system, by 

CC activating or inhibiting the proliferation, differentiation, or 

CC mobilisation (chemotaxis) of immune cells. The etiology of these immune 

CC deficiencies or disorders may be genetic, somatic, such as cancer or some 

CC autoimmune disorders, acquired (e.g. by chemotherapy or toxins), or 

CC infectious. They can also be used to treat inflammatory conditions, both 

CC chronic cind acute conditions. The products can also be used for detection 

CC and diagnosis. AAZ32002 to AAZ32080, and AAy49503 to AAY49511 represent 

CC sequences given in the exemplification of the present invention 

XX 



SQ Sequence 950 AA; 



Query Match 48.0%; Score 2480.5; DB 2; Length 950; 

Best Local Similarity 48.9%; Pred. No. 1.7e-183; 

Matches 485; Conservative 154; Mismatches 251; Indels 101; Gaps 24; 



Qy 


1 


MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 

:||| II : 1 : hill h II 1 W 
LLLLAAALLAVSDALGRPSEEDEELWP-ELE RAP GHGTTRLRLHAF 


60 


Db 


19 


64 


Qy 


61 


QEDFYLHLTPDAQFLAPAFSTEHLG VPLQGLTGGSSDLRRCFYSGDVNAEPDSF 

1 1 ||: nil |: :::| II :|l Illll II :| 1 
DQQLDLELRPDSSFLAPGFTLQNVGRKSGSETPLP ETDLAHCFYSGTVNGDPSSA 


114 


Db 


65 


119 


Qy 

Db 


115 
120 


AAVSLCGGLRGAFGYRGAEYVISPLPNAS APAAQRNSQGA HLLQ--RRGVPGG 

Ihlll hllll 1 1 Mil 11 Mi 1 III: hi II 
AALSLCEGVRGAFYLLGEAYFIQPLPAASERLATAAPGEKPPAPLQFHLLRRNRQGDVGG 


165 
179 


Qy 


166 


PSG DPTSRC GVASG— WNPAILRALDPYKPRRAGFGESRSRRRS 

1 II : III: 1 1 11: : 1 
TCGWDDEPRPTGKAETEDEDEGTEGEDEGPQWS PQDPALQGVGQP-TGTGS 


207 


Db 


180 


230 


Qy 

Db 


208 
231 


GRAKRFVS I PRYVETLWADESMVKFHGADLEHYLLTLLATAARLYRHPS ILNPINI VW 

1 Mill llll|::|IMI :|lh hllllll : llllhllll 1 :::|ll 
IRKKRFVSSHRYVETMLVADQSMAEFHGSGLKHYLLTLFSVAARLYKHPSIRNSVSLVW 


267 
290 


Qy 
Db 


268 
291 


KVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLNKVSDKHPEYWDTAILFTRQDLCGATTC 327 

|:|:M l|:|MmimiMI|: 1 11= 1 = : II 1 1 ill 1 III 1 1 : II 
KILVIHDEQKGPEVTSNAALTLRNFCNWQKQHNPPSDRDAEHYDTAILFTRQDLCGSQTC 350 


Qy 
Db 


. 328 
351 


DTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTAHELGHVFNMPHDNVKVCEEVFGKLRAN 

lllllllllhlll Mmilllll :|IMIIIIIIIIim|: M : 1 : : 
DTLGMADVGTVCDPSRSCSVIEDDGLQAAFTTAHELGHVFNMPHDDAKQCASLNGVNQDS 


387 
410 


Qy 


388 


HMMSPTLIQIDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPISLPEDLPGASYTLSQQ 

III: 1 :| : III III HI IIMIhlhM II II Mil M ::| 
HMMASMLSNLDHSQPWSPCSAYMITSFLDNGHGECLMDKPQNPIQLPGDLPGTSYDANRQ 


447 


Db 


411 


470 


Qy 


448 


CELAFGVGSKPCP-YMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKLCLKGACVE 

|: II M II h Mill : 1 :|II|:|MIIIIIMIIII M 1 II 
CQFTFGEDSKHCPDAASTCSTLWCTGTSGGVLVCQTKHFPWADGTSCGEGKWCINGKCVN 


506 


Db 


471 


530 


Qy 


507 


RHNLNKH RVDGSWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGVRVKYR 

: : M Ml 1 hi IIMMIIM hi II 1 Mllllll Ihll 

KTD-RKHFDTPFHGSWGMWGPWGDCSRTCGGGVQYTMRECDNPVPKNGGKYCEGKRVRYR 


563 


Db 


531 


589 


Qy 


564 


SCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKLICRAN 

Illll II : :|hllMMII 1 :: M hllhlMhhIMihl 
SCNLEDCPDN-NGKTFREEQCEAHNEFSKASFGSGPAVEWIPKYAGVSPKDRCKLICQAK 


623 


Db 


590 


648 


Qy 


624 


GTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCDGNLGSKKRFDKCGVCGGDNKSCK 

1 llhll IIIMM MlllllllllhhIMM : MhllllMllh :|l 

GIGYFFVLQPKWDGTPCSPDSTSVCVQGQCVKAGCDRIIDSKKKFDKCGVCGGNGSTCK 


683 


Db 


649 


708 


Qy 


684 


KVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLNGHFW 


743 


Db 


709 


h:| 1 Ih II ll-h::!! M : ::|hl : 1 hill : : 
KI SGSVTSAKPGYHDI ITI PTGATNI EVKQRNQRGSRNNGSFLAIKAADGTYILNGDYTL 


768 


Qy 


744 


SAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLTVEVLSVGKMTPPRVRYSFYLPKE 


803 


Db 


769 


1 :\:\:: II :|||lh hi h MlhMhll |:::h::: h 
STLEQDIMYKGWLRYSGSSAALERIRSFSPLKEPLTIQVLTVGNALRPKIKYTYFVKKK 


828 


Qy 


804 


PREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPPARWVAGSWGPCSASCGSGLQKRA 


863 


Db 


829 


II h: : 1 M II II II 1 hi 
KES FNAI PTES AWVIEEWGECSKSCELGWQRRL 


861 


Qy 


864 


VDCRGSAGQRTVPACDAAH--RPVETQACGE-PCPTWELSAWSPCSKSCGRGFQRRSLKC 

hll II MM M hi MM hi II llhlhh::|llll 
VECRDINGQ PASECAKEVKPASTRPCADHPCPQWQLGEWSSCSKTCGKGYKKRSLKC 


920 


Db 


862 


918 


Qy 


921 


VGHGGRLLARDQCNLHRKPQE-LDFCVLRPC 950 





: I I :|: : |: :||: : I 

Db 919 LSHDGGVLSHESCDPLKKPKHFIDFCTMAEC 949 



Search completed: August 5, 2006, 00:08:26 
Job time : 204 sees 



GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM protein - protein search « using sw model 



Run on: 



August 5, 2006, 00:26:10 



; Search time 186 Seconds 
(without alignments) 
2365.882 Million cell updates/sec 



Title: US-10-763-210-1 
Perfect score: 5164 



Sequence : 



1 MLLLGILTLAFAGRTAGGFE DQCNLHRKPQELDFCVLRPC 950 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 2097797 seqs, 463214858 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



2097797 



Database 



Published_Applications_AA_Main : 

1 : /EMC_Celerra_SIDS3/ptodata/2 

2 : /EMC_Celerra_SIDS3/ptodata/2 

3 : /EMC_Celerra_SIDS3/ptodata/2 

4 : /EMC_Celerra_SIDS3 /ptodata/2 

5 : /EMC_Celerra_SIDS3 /ptodata/2 

6 : /EMC_Celerra_SIDS3 /ptodata/2 



/pubpaa/US07_PUBCOMB . pep : * 
/pubpaa/US08_PUBCOMB . pep : * 
/pubpaa/US09_PUBCOMB . pep : * 
/pubpaa/US10A_PUBCOMB.pep: * 
/pubpaa/US10B_PUBCOMB.pep: * 
/pubpaa/USll_PUBCOMB . pep : * 



Pred, No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 




Query 












No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


5164 


100 


0 


950 


4 


US-10-763-210-1 


Sequence 


1, Appli 


2 


5156 


99 


8 


950 


3 


US-09-741-151-2 


Sequence 


2, Appli 


3 


5156 


99 


8 


950 


3 


US-09-965-631-4 


Sequence 


4, Appli 


4 


5156 


99 


8 


950 


4 


US-10-391-364-77 


Sequence 


77, Appl 


5 


5156 


99 


8 


950 


5 


US-10-753-267-56 


Sequence 


56, Appl 


6 


5156 


99 


8 


950 


5 


US-10-961-020-4 


Sequence 


4, Appli 


7 


5121.5 


99 


2 


978 


4 


US-10-275-107-59 


Sequence 


59, Appl 


8 


5111 


99 


0 


952 


4 


US-10-311-035-11 


Sequence 


11, Appl 


9 


4915 


95 


2 


924 


4 


US-10-093-463-28 


Secfuence 


28, Appl 


10 


4242.5 


82 


2 


823 


4 


US-10-163-316-2 


Sequence 


2, Appli 


11 


3957.5 


76 


6 


755 


4 


US-10-093-463-30 


Sequence 


30, Appl 


12 


2482.5 
'•2'4T2":'5*" 


48 


1 


950 


3 


US-09-3.21cra87Br-.4^_^_-..,.. 
' US-10-381-793-3 


— «« ^Sequence 


.App^li 
T, Appli 


^1-3— 


'4"8" 


1 


9'5T 




Sequence 


14 


2482 .5 


48 


1 


968 


4 


US-10-163-316-7 


Sequence 


1, Appli 


15 


2482 .5 


48 


1 


968 


4 


US-10-391-364-82 


Sequence 


82, Appl 


16 


2480.5 


48 


0 


931 


3 


US-09-741-151-4 


Sequence 


4, Appli 


17 


2480.5 


48 


0 


949 


5 


US-10-667-281-2 


Sequence 


2, Appli 


18 


2480.5 


48 


0 


950 


3 


US-09-373-658-2 


Sequence 


2, Appli 


19 


2480.5 


48 


0 


950 


3 


US-09-989-687-2 


Sequence 


2, Appli 


20 


2480.5 


48 


0 


950 


5 


US-10-741-600-1605 


Sequence 


1605, Ap 


21 


2480.5 


48 


0 


950 


5 


US-10-973-858-14 


Sequence 


14, Appl 


22 


2480.5 


48 


0 


967 


4 


US-10-105-929-2 


Sequence 


2, Appli 


23 


2480.5 


48 


0 


967 


4 


US-10-115-286-2 


Sequence 


2, Appli 


24 


2480.5 


48 


0 


967 


4 


US-10-757-450-2 


Sequence 


2, Appli 


25 


2480.5 


48 


0 
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5 


US-10-741-600-1603 


Sequence 


1603, Ap 



\ 




26 


2480 


5 


48 


.0 
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5 


US- 


10 
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-600-1604 


Sequence 


1604, Ap 


27 


2480 


5 


48 


.0 


967 


5 


US- 


10 


-923 


-035-37 


Sequence 


37, Appl 


28 


2480 


5 


48 


.0 


967 


5 


US- 


10 


-756 


-149-5619 


Sequence 


5619, Ap 


29 


2480 


5 


48 
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967 


5 


US- 


10 


-757 


-450-2 


Sequence 


2, Appli 


30 


2480 


5 


48 


.0 


967 


5 


us- 


10 


-995 


-561-979 


Sequence 


97 9 , App 


31 


2480 


5 


48 


.0 


967 


5 


us- 


10 


-995 


-561-980 


Sequence 


980, App 


32 


2480 


5 


48 


.0 


967 


6 


us- 


11 


-169 


-041-191 


Sequence 


191, App 


33 


2480 


5 


48 


.0 


968 


3 


us- 


09 


-373 


-658-125 


Sequence 


125, App 


34 


2480 


5 


48 


.0 


999 


5 


us- 


10 


-821 


-234-1251 


Sequence 


1251, Ap 


35 


2476 


5 


48 


.0 


967 


4 


us- 


10 


-755 


-889-134 


Sequence 


134, App 


36 


2471 


5 


47 


.9 


967 


3 


us- 


09 


-989 


-687-126 


Sequence 


126, App 


37 


2458 


5 


47 


. 6 


967 


6 


us- 


11 


-111 


-953-179 


Sequence 


179, App 


38 


2274 


44 


. 0 


727 


3 


us- 


09 


-445 


-023A-1 


Sequence 


1, Appli 


39 


2274 


44 


.0 


727 


4 


us- 


10 


-097 


-597-1 


Sequence 


1, Appli 


40 


2274 


44 


.0 


727 


4 


us- 


10 


-097 


-580-1 


Sequence 


1, Appli 


41 


2273 


44 


.0 


727 


3 


us- 


09 


-445 


-023A-12 


Sequence 


12, Appl 


42 


2273 


44 


.0 


727 


4 


us- 


10 


-097 


-597-12 


Sequence 


12, Appl 


43 


2273 


44 


.0 


727 


4 


us- 


10 


-097 


-580-12 


Sequence 


12, Appl 


44 


2135 


5 


41 


.4 


890 


3 


us- 


09 


-373 


-658-4 


Sequence 


4, Appli 


45 


2135 


5 


41 


,4 


890 


3 


us- 


09 


-989 


-687-4 


Sequence 


4 , Appl i 



ALIGNMENTS 



RESULT 1 
US-10-763-210-1 

Sequence 1, Application US/10763210 
Publication No. US20040142445A1 
GENERAL INFORMATION: 
APPLICANT: Yamanouchi Pharmaceutical Co., Ltd. 
APPLICANT: Kazusa DNA Research Institute 

TITLE OF INVENTION: NOVEL METALLOPROTEASE HAVING AGGRECANASE ACTIVITY 
FILE REFERENCE: Q67541 

CURRENT APPLICATION NUMBER: US/10/763,210 
CURRENT FILING DATE: 2004-01-26 
PRIOR APPLICATION NUMBER: US/10/009,332 
PRIOR FILING DATE: 2001-12-10 
PRIOR APPLICATION NUMBER: JPA Hei 11-321740 
PRIOR FILING DATE: 1999-11-11 
PRIOR APPLICATION NUMBER: JPA 2000-144020 
PRIOR FILING DATE: 2000-05-16 
NUMBER OF SEQ ID NOS: 35 
SOFTWARE: PatentIn version 3.1 
SEQ ID NO 1 
LENGTH: 950 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-763-210-1 

Query Match 100.0%; Score 5164; DB 4; Length 950; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 950; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

IIIIIIMIIIIIMIIIIIIIIIIIIIIMIIMIIIIIIIIMIIIIIIMIIIIIII 

Db 1 MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

Qy 61 QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIilll 

Db 61 QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

Qy 121 GGLRGAFGYRGAEYVI SPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

IIMIIIMIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIMIMIIIIIIIIIIIIII 

Db 121 GGLRGAFGYRGAEYVI SPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

Qy 181 NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

lllllllllllllllillllllllllllllllllllllllllllllllllllllllllll 
Db 181 NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 



Qy 



241 YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 



Illllllllllllilllllllllllllllllllllllllllllllllllllillllllll 



Db 


241 


YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 


300 


Qy 


301 


KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 


360 




lllllllillllllllltllllllillllltllllllllllllllllllltlllllllll 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


KVSDKH PEYVyTOTAI LFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVI EDDGLPSAFTTA 


360 


Qy 


361 


HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 


420 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 




Db 


361 


HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 


420 


Qy 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 






1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M M 1 1 M 1 1 1 




Db 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 


Qy 


481 


QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 


540 






liialiillillllllllilllllllllllllllllllllllllllllllllllllllil 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 


540 


Qy 


541 


RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 




Db 


541 


RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 


Qy 


601 


VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 


660 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 


660 


Qy 


661 


GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 


720 




iiaiiiiiiiiiiAililllllllllllllllllftllllllltlllllllllllllllll 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 




Db 


661 


GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 


720 


Qy 


721 


GDDNYIiALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 


780 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


721 


GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 


780 


Qy 


781 


VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 


840 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


781 


VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 


840 


Qy 


841 


ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 


900 






llllllilllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


841 


ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 


900 


Qy 


901 


AWS PCSK SCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 








llllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


901 


AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 





RESULT 2 
US-09-741-151-2 

; Sequence 2, Application US/09741151 . 
; Publication No. US20020086400A1 

; GENERAL INFORMATION: ^ 
; APPLICANT: ZHU , Shiaoping et al 

TITLE OF INVENTION: ISOLATED HUMAN PROTEASE PROTEINS, 
; TITLE OF INVENTION: NUCLEIC ACID MOLECULES ENCODING HUMAN PROTEASE PROTEINS, AND 
; TITLE OF INVENTION: USES THEREOF 
; FILE REFERENCE: CL001005 

; CURRENT APPLICATION NUMBER: US /09/ 74 1.151 
; CURRENT FILING DATE: 2000-12-08 
; NUMBER OF SEQ ID NOS : 4 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 2 

LENGTH: 950 

TYPE: PRT 

ORGANISM: Human 
US-09-741-151-2 

Query Match 99.8%; Score 5156; DB 3 ; Length 950; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 949; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 


1 


MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 




Db 


1 


MLLLGILTLAFAGRTAGGSEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 


60 


Qy 


61 


QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 


120 


Qy 


121 


GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 


180 


Qy 


181 


NPAI LRALDPYKPRRAGFGESRSRRRSGRAKRFVSI PRYVETLWADESMVKFHGADLEH 


240 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 




Db 


181 


NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 


240 


Qy 


241 


YLLTLLATAARLYRHPSILNPINIWVKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 


300 






i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 t 1 1 1 1 1 t 1 1 1 1 i 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M M 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 M 




Db 


241 


YLLTLLATAARLYRH PS I LNPINI VWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 


300 


Qy 


301 


KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 


360 


Qy 


361 


HELGHVFNMPHDNVKVCEEVFGKLRANHMMS PTLIQIDRANPWSACSAAI ITDFLDSGHG 


420 






tllllllttlllllllllllltilllllllllllllllilllllllllllllllllllll 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


HELGHVFNMPHDNVKVCEEVFGKLRANHMMS PTLIQIDRANPWSACSAAI ITDFLDSGHG 


420 


Qy 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 


Qy 


481 


QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 


540 






iiiiiiiiiiiiiiiiitiiilllilllllltlltlllllllllllilllllllllllll 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 


540 


Qy 


541 


RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 






iiiftiiiiiiiiiiliilllllllllllllllllllllllllllllllllllllllllll 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


541 


RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 


Qy 


601 


VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 


660 






1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 i 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 i 1 1 1 1 




Db 


601 


V AWVPK Y SGVS PRDKCKL I CRANGTG YF YVLAPK WDGTLCS PDSTSVCVQGKC I KAGCD 


660 


Qy 


661 


GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 


720 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 


720 


Qy 


721 


GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 


780 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


721 


GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 


780 


Qy 


781 


VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 


840 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


781 


VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 


840 


Qy 


841 


ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 


900 






IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


841 


ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 


900 


Qy 


901 


AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 




Db 


901 


IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 





RESULT 3 
US-09-965-631-4 

; Sequence 4, Application US/09965631 
; Patent No. US20020115842A1 
; GENERAL INFORMATION: 



APPLICANT: Friddle, Carl Johan 
APPLICANT: Hilbun, Erin 

TITLE OF INVENTION: No. US20020115842Alel Human Proteases and Polynucleotides Encoding the Same 
FILE REFERENCE: LEX-0241-USA 



CURRENT APPLICATION NUMBER: US/09/965,631 
CURRENT FILING DATE: 2001-09-27 
PRIOR APPLICATION NUMBERa_JIS-W236 , 689 
PRIOR FILING DATE 2000-09- 
NUMBER OF SEQ ID NOT;^ 7 

SOFTWARE: FastSEQ forWindows Version 4.0 
SEQ ID NO 4 
LENGTH: 950 
TYPE: PRT 

ORGANISM: homo sapiens 
US-09-965-631-4 

Query Match 99.8%; Score 5156; DB 3; Length 950; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 949; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

llllllllllllllllll lllllllllllllllllllllllllllllllllllllllll 
Db 1 MLLLGILTLAFAGRTAGGSEPEREWVPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

Qy 61 QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

IIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 61 QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

Qy 121 GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

IIIIIIIIIIIIIIIIIMIIIIIIIIMIililllllllllllllllllllllllllll 

Db 121 GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

Qy 181 NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

lllllillllllMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIMMIIIII 

Db 181 NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

Qy 241 YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

MMilllllllMIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIMIIItllllMII 

Db 241 YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

Qy 301 KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 360 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIilllllllllllllllllllllllll 

Db 301 KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 360 

Qy 361 HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 420 

IIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIII 

Db 361 HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 420 

Qy 421 DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 480 

IIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIMIIIIIIIMIMIIIIIIIIIIM 

Db 421 DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 480 

Qy 481 QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 540 

IIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 481 QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 540 

Qy 541 RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 600 

IMIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIillllllllllllMII 

Db 541 RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 600 

Qy 601 VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 660 

IIIIIIIMIIIMIIIIIIIIIIIIIIMIIIIMIIIIIMIIIIIIIIMIIIIIII 

Db 601 VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 660 

Qy 661 GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 720 

IMIMMIIilllllllllllllllMIMIIIIIIIIIIIIIIIIIMIIIIilMII 

Db 661 GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 720 



Qy 
Db 



721 GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 780 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

721 GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 780 



Qy 781 VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 840 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIII 

Db 781 VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 840 

Qy 841 ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 900 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIilllMIIIIIIIII 

Db 841 ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 900 

Qy 901 AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 

llllllllllllllllllllllllllllllllllllllllllllllllll 
Db 901 AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 



RESULT 4 

US-10-391-364-77 

Sequence 77, Application US/10391364 
Publication No. US20040121349A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
TITLE OF INVENTION 
TITLE OF INVENTION 
TITLE OF INVENTION 



X 



Millennium Pharmaceuticals, Inc. 
Meyers, Rachel E. 
Carroll, Joseph M. 
Cook, William James 
Kapeller-Libermann, Rosana 
Welch, Nadine S. 
Bandaru , Ra j asekhar 

NOVEL 27877, 18080, 14081, 32140, 50352, 
16658, 14223, 16002, 50566, 65552 AND 65577 MOLECULES AND 
USES THEREFOR 
FILE REFERENCE: MPI03-019OMNIM 
CURRENT APPLICATION NUMBER: US/10/391,364 
CURRENT FILING DATE: 2003-03-18 
PRIOR APPLICATION NUMBER: US 09/950,370 
PRIOR FILING DATE: 2001-09-10 
PRIOR APPLICATION NUMBER: US 60/231,084 
PRIOR FILING DATE: 2000-09-08 
PRIOR APPLICATION NUMBER: US 10/294,039 
PRIOR FILING DATE: 2002-11-13 
PRIOR APPLICATION NUMBER: US 60/338,587 
PRIOR FILING DATE: 2001-11-13 
PRIOR APPLICATION NUMBER: US 10/266,035 
PRIOR FILING DATE: 2002-10-07 
PRIOR APPLICATION NUMBER: US 60/328,198 
PRIOR FILING DATE: 2001-10-09 
PRIOR APPLICATION NUMBER: US 09/717,926 
PRIOR FILING DATE: 2000-11-21 
PRIOR APPLICATION NUMBER: US 60/214,707 
PRIOR FILING DATE: 2000-06-27 
PRIOR APPLICATION NUMBER: US 10/268,036 
PRIOR FILING DATE: 2002-10-09 
PRIOR APPLICATION NUMBER: US 60/327,820 
PRIOR FILING DATE: 2001-10-09 

Remaining Prior Application data removed - See File Wrapper or PALM. 

NUMBER OF SEQ ID NOS : 93 

SOFTWARE: FastSEQ for Windows Version 4,0 

SEQ ID NO 77 
LENGTH: 950 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-391-364-77 

Query Match 99.8%; Score 5156; DB 4; Length 950; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 949; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

llllllllllllllllll lllllllllllllllllllllllllllllllllllllllll 
Db 1 MLLLGILTLAFAGRTAGGSEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 



Qy 



61 QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 



Db 



61 



QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 



Qy 121 GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 121 GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

Qy 181 NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 181 NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

Qy 241 YLLTLLATAARLYRHPSILNPINI\A^KVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

lllllllllllilllllllllllllllllllllllllllllllllllillllllilllll 

Db 241 YLLTLLATAARLYRHPSILNPINIWVKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

Qy 301 KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 360 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIII 

Db 301 KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 360 

Qy 361 HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 420 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 361 HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 420 

Qy 421 DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 480 

IIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII 

Db 421 DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKIiWCTGKAKGQMVC 480 

Qy 481 QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 540 

IIIIIIIIIIIIIIIIIIIUIIIIIIIIIIIIIIilMIIIIMIIIIIIIIIIIIIII 

Db 481 QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 540 

Qy 541 RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 600 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 541 RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 600 

Qy 601 VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 660 

lllllllllilllMIIIIMMIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIII 

Db 601 VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 660 

Qy 661 GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 720 

llllllllllllllllilllllllllllllllllllllllllilMIIIIIIIIIIIIII 

Db 661 GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 720 

Qy 721 GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 780 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIMIIII 

Db 721 GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 780 

Qy 781 VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 840 

lllllllllllllliMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 

Db 781 VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 840 

Qy 841 ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 900 

IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIillllllllllll 

Db 841 ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 900 

Qy 901 AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 

IIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII 

Db 901 AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 



RESULT 5 

US-10-753-267-56 

Sequence 56, Application US/10753267 
Publication No. US20050037946A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Millennium Pharmaceuticals, 
Stagliano, Nancy E. 
Healy, Aileen 
Acton, Susan L. 
Galvin. Katherine M. 
Donoghue, Mary A. 
Rodrigue-Way, Amelie 



Inc . 



APPLICANT: Tomlinson, James E. 



TITLE OF INVENTION 
TITLE OF INVENTION 
TITLE OF INVENTION 
TITLE OF INVENTION 
TITLE OF INVENTION 
TITLE OF INVENTION 
TITLE OF INVENTION 
TITLE OF INVENTION 
TITLE OF INVENTION 



X 



METHODS AND COMPOSITIONS FOR TREATING 
CARDIOVASCULAR DISEASE USING 1722, 10280, 59917, 85553, 
10653, 9235, 21668, 17794, 2210, 6169, 10102, 21061, 17662, 
1468, 12282, 6350, 9035, 1820, 23652, 7301, 8925, 8701, 
3533, 9462, 9123, 12788, 17729, 65552, 1261, 21476, 33770, 
9380, 2569654, 33556, 53656, 44143, 32612. 10671, 261. 
44570, 41922, 2552, 2417, 19319, 43969, 8921, 8993, 955, 
32345, 966, 1920, 17318, 1510, 14180, 26005, 554, 16408, 
42028, 112091, 13886, 13942, 1673, 54946 OR 2419 
FILE REFERENCE: MPI03-003P1RNOMNIM 
CURRENT APPLICATION NUMBER: US/10/753,267 
CURRENT FILING DATE: 2004-01-08 
PRIOR APPLICATION NUMBER: US 60/439,683 
PRIOR FILING DATE: 2003-01-13 
PRIOR APPLICATION NUMBER: US 60/445,216 
PRIOR FILING DATE: 2003-02-05 
PRIOR APPLICATION NUMBER: US 60/448,036 
PRIOR FILING DATE: 2003-02-18 
PRIOR APPLICATION NUMBER: US 60/454,189 
PRIOR FILING DATE: 2003-03-12 
PRIOR APPLICATION NUMBER: US 60/457,541 
PRIOR FILING DATE: 2003-03-25 
PRIOR APPLICATION NUMBER: US 60/466,411 
PRIOR FILING DATE: 2003-04-29 
PRIOR APPLICATION NUMBER: US 60/469,041 
PRIOR FILING DATE: 2003-05-08 
PRIOR APPLICATION NUMBER: US 60/477,414 
PRIOR FILING DATE: 2003-06-10 
PRIOR APPLICATION NUMBER: US 60/478,560 
PRIOR FILING DATE: 2003-06-13 
PRIOR APPLICATION NUMBER: US 60/489,772 
PRIOR FILING DATE: 2003-07-24 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 130 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 56 
LENGTH: 950 
TYPE: PRT 

ORGANISM: Homo Sapiens 
US-10-753-267-56 

Query Match 99.8%; Score 5156; DB 5; Length 950; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 949; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

llllllllllllllllll IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 1 MLLLGILTLAFAGRTAGGSEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

Qy 61 QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

lllllllllllllllllllilMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIII 

Db 61 QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

Qy 121 GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

IIIIIIIIIIMIIIIMIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIII 

Db 121 GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

Qy 181 NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIilMIIIIIIIIMIIIII 

Db 181 NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

Qy 241 YLLTLLATAARLYRHPSILNPINIWVKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

lllllllllllllllilllllllllllllllllllllllMIIMMIIIIIIIIIIIII 

Db 241 YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

Qy 301 KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 360 

I I I i I I I I I I I I I I I I 1 I I I I I I I I I I I ri I I I I I I I I I I I I I M I I I I I I I i I I I I I I I 

Db 301 KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 360 



Qy 



361 HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 420 



nh 


361 


Ov 


421 


Db 


421 


Ov 


481 


Db 


481 


Ov 
vy 


541 




541 


Ov 


601 


Db 


601 


Ov 


661 


Db 


661 


Ov 

vy 


721 


nK 


721 


yy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 



IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIilllllll 

HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 420 
DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 480 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIII 

DCLLDQPSKPI SLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 480 

QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 540 

lllllllllllllllllllllllllllllllillllllllllllllllllllllllllll 
QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 540 



RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 



600 



600 



660 



VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 

llllllllilllllllllllllllllllllllllllllllllllllllllllllllllll 

VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 660 

GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAI PAGASS IDIRQRGYKGLI 720 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAI PAGASS IDIRQRGYKGLI 720 

GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 7 80 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 

GDDNYliALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 780 

VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 840 

lllllllllllillllllllllllllllllllllllllllllllllllllllllllllll 
VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 840 



ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 



900 



900 



AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 

llllllllllllllllllllllllllllllllllllllllllllllllll 
AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 



950 



950 



RESULT 6 
US-10-961-020 
Sequence 4, 



-4 



Application US/10961020 
Publication No, US20050090657A1 
GENERAL INFORMATION: 
APPLICANT: Friddle, Carl Johan 
APPLICANT: Hilbun, Erin 

TITLE OF INVENTION: Novel Human Proteases and Polynucleotides Encoding the Same 
FILE REFERENCE: LEX-0241-USA 
CURRENT APPLICATION NUMBER: US/ 10/961 , 020 
CURRENT FILING DATE: 2004-10-08 
PRIOR APPLICATION NUMBER: US/ 09 /965 , 631 
PRIOR FILING DATE: 2001-09-27 
PRIOR APPLICATION NUMBER: US 60/236,689 
PRIOR FILING DATE: 2000-09-29 
NUMBER OF SEQ ID NOS : 7 * 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 4 
LENGTH: 950 
TYPE: PRT 

ORGANISM: homo sapiens 
US-10-961-020-4 



Query Match 99 . 8% ; 

Best Local Similarity 99.9%; 
Matches 94 9; Conservative 



Score 5156; 
Pred. No. 0; 
0; Mismatches 



DB 5; Length 950; 



Indels 



Gaps 



Qy 



Db 



1 MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

IIMIIIIIIIIIIIIII IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

1 MLLLGILTLAFAGRTAGGSEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 



Qy 


61 


QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 


120 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


61 


QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 


120 


Qy 


121 


GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 


180 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


121 


GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 


180 


Qy 


181 


NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 


240 




lllllllllllllllllllllllllllllllllllllllllllllllllllllllllill 




Db 


181 


NPAI LRALDPYKPRRAGFGESRSRRRSGRAKRFVS I PRYVETLWADESMVKFHGADLEH 


240 


Qy 


241 


YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTIiRNFCAWQKKLN 


300 




i itiiiiiAillllillilllliAllllllftllllllAIIAIIIIIItllllill 

IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


241 


YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 


300 


Qy 


301 


KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 


360 




1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 


360 


Qy 


361 


HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 


420 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 




Db 


361 


HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 


420 


Qy 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 






ilftftilllllllll»lllllil>ftll lllllllkllilllllll lllllllllllillll 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 




Db 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 


Qy 


481 


QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 


540 






lllllllllllllllllllllllllililMIIIIMIIIIIIIIIIIIIIIIIIIMII 




Db 


481 


QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 


540 


Qy 


541 


RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 






IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIMIII 




Db 


541 


RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 


Qy 


601 


VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 


660 






IIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 




Db 


601 


VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 


660 


Qy 


661 


GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 


720 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 


720 


Qy 


721 


GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 


780 






1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


721 


GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 


780 


Qy 


781 


VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 


840 


Db 


781 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 
VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 


840 


Qy 


841 


ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 


900 






llllilllllllMMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIII 




Db 


841 


ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 


900 


Qy 


901 


AWS PCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 






lilllllllllllllllllllllllllMIIIIIIIIIIIIIMIIIIII 




Db 


901 


AWS PCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 





RESULT 7 

US-10-275-107-59 

Sequence 59, Application US/10275107 
Publication No. US20040063107A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



PLOWMAN, GREGORY D. 
WHYTE. DAVID 
SUDARSANAM , SUCHA 
MANNING, GERARD 
CAENEPEEL, SEAN R. 



APPLICANT: PAYNE, VILLA 
TITLE OF INVENTION: NOVEL PROTEASES 
FILE REFERENCE: 038602/1479 
CURRENT APPLICATION NUMBER: US/10/275,107 
CURRENT FILING DATE: 2003-11-03 
PRIOR APPLICATION NUMBER: PCT/USOl/14431 
PRIOR FILING DATE: 2001-05-04 
PRIOR APPLICATION NUMBER: 60/201,879 
PRIOR FILIN G DATE:_2000-05-04 

NUMbER* OF SEQ"id'"nOSTT.05 * 

SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 59 
LENGTH: 978 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-275-107-59 

Query Match 99.2%; Score 5121.5; DB 4; Length 978; 

Best Local Similarity 99.4%; Pred. No. 0; 

Matches 947; Conservative 0; Mismatches 3; Indels 3; Gaps 2; 

MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

llllllllllllllllli IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 
MLLLGILTLAFAGRTAGGSEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 85 

QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
QEDFY LHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 145 

GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 1 80 

illllllllllllllllllllllllllllllllllliillllllllllllllllllllll 

GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 205 
NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllilMI 

NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 265 

YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
YLLTLLATAARLYRHPSILNPINI WVKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 325 

KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 385 

HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAI ITDFLDSGHG 420 

IIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIillllllll 

HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 445 

DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 4 80 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 505 

QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 540 

lllilllllllllllllllllllllllllllllllllllllllllllllllMIIIIIII 

QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 565 

RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 625 

VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPK-WDGTLCSPDSTSVCVQGKCIKAGC 659 

llllllllllllllllllllllllllllllllll lllllllllllllllllllllllll 
VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKVWDGTLCSPDSTSVCVQGKCIKAGC 685 

DGNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGL 719 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
DGNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGL 745 



Ov 


1 


Db 


26 




61 




86 


Ov 


121 




146 


Ov 

yy 


181 


uo 


206 


Ov 

yy 


241 


Db 


266 


Qy 


301 


Db 


326 


Qy 


361 


Db 


386 


Qy 


421 


Db 


446 


Qy 


481 


Db 


506 


Qy 


541 


Db 


566 


Qy 


601 


Db 


626 


Qy 


660 


Db 


686 



Qy 



720 IGDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPL 779 
llllllllllllllllllllllllllllllllllllllllilllllllllllllllllll 



Db 



746 IGDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSIiLRYSGTGTAVESLQASRPILEPL 805 



Qy 780 TVEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPR--GPSVLHNSVLSLSNQVEQPDD 837 

IIIIIIIMIIIIIIIIIIIIIIIIIillllll II IIIIIIIIIIIIIIIMIIII 

Db 806 TVEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPPHPRGGGPSVLHNSVLSLSNQVEQPDD 865 

Qy 838 RPPARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTW 897 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 866 RPPARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTW 925 

Qy 898 ELSAWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 

lllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 926 ELSAWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 978 



RESULT 8 
US-10-311-035-11 

Sequence 11, Application US/10311035 
Publication No. US20040023243A1 
GENERAL INFORMATION: 



INCYTE GENOMICS, INC. 
YUE, Henry 
ELLIOTT, Vicki 
GANDHI, Ameena R. 
LAL, Preeti 
AU-YOUNG, Janice 
TRIBOULEY, Catherine M. 
DELEGEANE, Angelo M. 
BAUGHN, Mar i ah R. 
NGUYEN, Danniel B. 
LEE, Ernestine A. 
HAFALIA, April 
KHAN, Far rah A. 
CHAWLA, Narinder K. 
YAO, Monique G, 
LU, Dyung Aina M. 
ARVIZU, Chandra S. 
TANG, Y. Tom 
WALSH, Roderick T. 
AZIMZAI, Yalda 
LU, Yan 

RAMKUMAR, Jayalaxiini 
XU, Yuming 
REDDY , Roopa 
DAS, Depopriya 
KEARNEY, Li am 
KALLICK, Deborah A. 
TITLE OF INVENTION: Proteases 
FILE REFERENCE: PI-0123 PCT 
CURRENT APPLICATION NUMBER: US/10/311,035 
CURRENT FILING DATE: 2002-12-10 

PRIOR APPLICATION NUMBER: 60/212,336; 60/213,955; 60/215,396; 60/216,821; 60/218,946 
PRIOR FILING DATE: .2000-06-16; 2000-06-22; 2000-06-29; 2000-07-07; 2000-07-14 
NUMBER OF SEQ ID NOS^: 4 5 
SOFTWARE: PERL Program 
SEQ ID NO 11 
LENGTH: 952 
TYPE: PRT 

ORGANISM: Homo sapiens 

FEATURE : 

NAME/KEY: misc_f eature 

OTHER INFORMATION: Incyte ID No. US20040023243A1 7473089CD1 
US-10-311-035-11 

Score 5111; DB 4; Length 952; 
Pred. No. 0; 
0; Mismatches 4; Indels 2; Gaps 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Query Match 

Best Local Similarity 9"! 
Matches 946; Conservative 




Qy 



Db 



1 MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

MIIIIIIIMIIIilll lllllllllllllllllllllllllllllllllllllllll 

1 MLLLGILTLAPAGRTAGGSEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 



Qy 
Db 



61 QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
61 QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 



Qy 121 GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 121 GGLRGAFGYRGAEYVISPliPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

Qy 181 NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllli 
Db 181 NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

Qy 241 YLLTLLATAARLYRHPSILNPINIWVKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 241 YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

Qy 301 KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 360 

IIIIIIIIIIIIIMIIIIMIIIIilllllllllllllllllllllMIIIIIIIIIII 

Db 301 KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 360 

Qy 361 HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 420 

lllllllllllllllllllllllllllllllllllllillllilllllllllllllllll 
Db 361 HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 420 

Qy 421 DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 480 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllillll 
Db 421 DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 480 

Qy 4 81 QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 540 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 4 81 QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 540 

Qy 541 RQCTNPTPANGGKYCEGVRVKYRSCNliEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 600 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiniiiiiiiiiiiiiiiii 

Db 541 RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 600 

Qy 601 VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPK-WDGTLCSPDSTSVCVQGKCIKAGC 659 

lllllllllllllllllllllllllillllllll lllllllllllllllllllllllll 

Db 601 VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKVWDGTLCSPDSTSVCVQGKCIKAGC 660 

Qy 660 DGNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGL 719 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 661 DGNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGL 720 

Qy 720 IGDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPL 779 

lllllllllllllllllllllllllllllllllllllllllllllllllllllillllll 

Db 721 IGDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPL 780 

Qy 780 TVEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPR-GPSVLHNSVLSLSNQVEQPDDR 838 

IIIIIIMIIIIIIIIIMIIIIIIIIIMIII II llllllllllllllllllllll 
Db 781 TVEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPPHPRGGPSVLHNSVLSLSNQVEQPDDR 840 

Qy 839 PPARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWE 898 

lllllllllllllllllllllllllll IIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

Db 841 PPARWVAGSWGPCSASCGSGLQKRAVDWRGSAGQRTVPACDAAHRPVETQACGEPCPTWE 900 

Qy 899 LSAWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 

I IIIIIIIIIIIIMIIIIIIIIII lllllllllllllllllllllllllll 

Db 901 LSAWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 952 

RESULT 9 

US-10-093-463-28 

; Sequence 28, Application US/10093463 

; Publication No. US20030208039A1 

; GENERAL INFORMATION: 

; APPLICANT: Padigaru. Mural idhara 

APPLICANT: Shenoy, Suresh 

APPLICANT: Kekuda, Ramesh 
; APPLICANT: Gusev, Vladimir 



APPLICANT: Pochart, Pascal 
APPLICANT: Zhong, Mei 
APPLICANT: Rastelli, Luca 
APPLICANT : Me zes , Pe t er 
APPLICANT: Smiths on, Glermda 
APPLICANT: Guo, Xiaojia 
APPLICANT: Gerlach, Valerie 
APPLICANT: Gasman, Stacie 
APPLICANT: Boldog, Ferenc 
APPLICANT: Li, Li 
APPLICANT: Zerhusen, Bryan 
APPLICANT: Tchernev, Velizar 
APPLICANT: Gangolli, Esha 
APPLICANT: Verne t , Corine 
APPLICANT: Pena, Carol 
APPLICANT: Burgess, Catherine 
APPLICANT: Liu, Xiaohong 
APPLICANT: Spytek, Kimberly 
APPLICANT: Gorman, Linda 
APPLICANT: Spaderna, Steven 
APPLICANT : Voss , Edward 
APPLICANT: Malyankar, Uriel 
APPLICANT: Anderson, David 
APPLICANT: Patturajan, Meera 
APPLICANT: Miller, Charles 
APPLICANT: Taupier, Raymond J. Jr. 

TITLE OF INVENTION: No. US20030208039Alel Antibodies that Bind to Antigenic Polypeptides, 
Acids 

TITLE OF INVENTION: Encoding The Antigens, and Methods of Use. 
FILE REFERENCE: 21402-290A (Cura 5 9 OAT) 
CURRENT APPLICATION NUMBER: US/10/093,463 
CURRENT FILING DATE: 2002-06-24 
PRIOR APPLICATION NUMBER: 60/283,675 
PRIOR FILING DATE: 2001-04-14 
PRIOR APPLICATION NUMBER: 60/338,092 
PRIOR FILING DATE: 2001-12-03 
PRIOR APPLICATION NUMBER: 60/274,281 
PRIOR FILING DATE: 2001-03-08 
PRIOR APPLICATION NUMBER: 60/274,101 
PRIOR FILING DATE: 2001-03-08 
PRIOR APPLICATION NUMBER: 60/325,681 
PRIOR FILING DATE: 2001-09-27 
PRIOR APPLICATION NUMBER: 60/304,354 
PRIOR FILING DATE: 2001-07-10 
PRIOR APPLICATION NUMBER: 60/279,995 
PRIOR FILING DATE: 2001-03-30 
PRIOR APPLICATION NUMBER: 60/294,899 
PRIOR FILING DATE: 2001-05-31 
PRIOR APPLICATION NUMBER: 60/287,424 
PRIOR FILING DATE: 2001-04-30 
PRIOR APPLICATION NUMBER; 60/299,027 
PRIOR FILING DATE: 2001-06-18 
PRIOR APPLICATION NUMBER: 60/309,198 
PRIOR FILING DATE: 2001-07-31 
PRIOR APPLICATION NUMBER: 60/281,194 
PRIOR FILING DATE: 2001-04-04 
PRIOR APPLICATION NUMBER: 60/274,194 
PRIOR FILING DATE: 2001-03-08 
PRIOR APPLICATION NUMBER: 60/274,849 
PRIOR FILING DATE: 2001-03-09 
PRIOR APPLICATION NUMBER: 60/330,380 
PRIOR FILING DATE: 2001-10-18 
PRIOR APPLICATION NUMBER: 60/275,235 
PRIOR FILING DATE: 2001-03-12 
PRIOR APPLICATION NUMBER: 60/288,342 
PRIOR FILING DATE: 2001-05-03 
PRIOR APPLICATION NUMBER: 60/275,578 
PRIOR FILING DATE: 2001-03-13 
NUMBER OF SEQ ID NOS : 370 
SOFTWARE: Patentin Ver . 2.1 
SEQ ID NO 28 



Nucleic 



LENGTH: 924 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-093-463-28 



Query Match 95.2%; Score 4915; DB 4; Length 924; 

Best Local Similarity 95.8%; Pred. No. 0; 

Matches 913; Conservative 2; Mismatches 6; Indels 32; Gaps 3; 



Qy 


1 


MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 


60 




1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 


MLLLGI LTLAFAGRTAGGSEPEREVWPIRLDPDINGRRY YWRGPEDSGDQGLI FQITAF 


60 


Qy 


61 


QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 




Db 


61 


QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 


120 


Qy 


121 


GGLRGAFG YRGAEYVI S PLPNAS APAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 


180 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


GGLRGAFGYRGAEYVI S PLPNAS APAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 


180 


Qy 


181 


NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 


240 




lllllllllllllllllllllllllllllllllllllllllllllillllllllllllll 




Db 


181 


NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 


240 


Qy 


241 


YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 


300 




IIIAIIlillftllllllillAlllllilllltllllllllllllllllllllllllllll 

llllllllllllllllllllllllllllllllllllllllllilllllllllilllllll 




Db 


241 


YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 


300 


Qy 


301 


KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 


360 






Illllll4lltlllllllllillltlilllllllllllllllllllllllllllllllll 

1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 


360 


Qy 


361 


HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 


420 






iiiiiiiiiiiiiiiiiiiiillllllllllllillllllllllllllllilllllllil 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 i 1 1 1 1 




Db 


361 


HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 


420 


Qy 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 






iiiiiiiiiiiii<iiiiilillllllllllllftllllllllllllllllllllllllll 

1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 


Qy 


481 


QTRHF PWADGTSCGEGKLCLKGACVERHNLNKH RVDGSWAKWDP YGPCSRTCGGGVQ 


537 






illlllllllliAillllilllllllllillll lllllllllllllllllllllll 

lllllllllllllllllllllllllllllllll lllllllllllllllllllllll 




Db 


481 


QTRHF PWADGTSCGEGKLCLKGACVERHNLNKHS SSQVDGSWAKWDP YGPCSRTCGGGVQ 


540 


Qy 


538 


LARRQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRL 


597 






lltAlllllllllllllllllllllllllllftlll lllllllllllllllllllllll 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII lllllllllllllllllllllll 




Db 


541 


L ARRQCTNPT P ANGGK YCEGVRVK YR SCNL E PC P S - - SGK SFREEQCE AFNG YNH STNRL 


598 


Qy 


598 


TLAVAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCS POSTS VCVQGKCIKA 


657 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


599 


TLAVAWV PK Y SG V S PRDKCKL I CRANGTG Y F YVLAPK WDGTLC S PD ST S VCVQGKC I KA 


658 


Qy 


658 


GCDGNLG SKKRFDKCGVCGGDNKSCKKVTGLFTKPMHG YNF WAX PAGAS S I DI RQRGY K 


717 






IIIIIIIIIIIIIIIIIIIIIMIIIIIIIi : llliiMlllllllllllllllli 




Db 


659 


GCDGNLGSKKRFDKCGVCGGDNKSCKKVTGLLSPARHGYNFWAIPAGASSIDIRQRGYK 


718 


Qy 


718 


GLIGDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILE 


777 






lllllllllilllllllllllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


719 


GLIGDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILE 


778 


Qy 


778 


PLTVEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDD 


837 






IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 




Db 


779 


PLTVEVLSVGKMTPPRVRYSFYLPKEPREDKSSH 


812 


Qy 


838 


RPPARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTW 


897 






lllllllllllllllllllllllllll lllllllllllllllllllllllllllllll 




Db 


813 


-PPARWVAGSWGPCSASCGSGLQKRAVDWRGSAGQRTVPACDAAHRPVETQACGEPCPTW 


871 


Qy 


898 


ELSAWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 





llllllllllllllllllllllllllllllllllllllllllllillllllll 



Db 872 ELSAWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 924 



RESULT 10 
US-10-163-316-2 

Sequence 2, Application US/10163316 
Publication No. US20020197703A1 
GENERAL INFORMATION: 
APPLICANT: Kapeller-Libermann, Rosana 

TITLE OF INVENTION: 65552. A Human Matrix Metalloproteinase and Uses 
TITLE OF INVENTION: Therefor 
FILE REFERENCE: MPI01-025P1RNM 
CURRENT APPLICATION NUMBER: US/10/163,316 
CURRENT FILING DATE: 2002-06-05 
PRIOR APPLICATION NUMBER: 60/297,863 
PRIOR FILING DATE: 2001-06-13 
NUMBER OF SEQ ID NOS: 10 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 2 
LENGTH: 823 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-163-316-2 

Query Match 82.2%; Score 4242.5; DB 4; Length 823; 

Best Local Similarity 96.9%; Pred. No. 0; 

Matches 791; Conservative 0; Mismatches 2; Indels 23; Gaps 1; 

MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 60 

llllllllllllllllll Mlllllllllllllllllllllllllllllllllllllll 
MLLLGILTLAFAGRTAGGSEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 6 0 

QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 120 

IIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIillllllll llllllllllllll 

QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGGVNAEPDSFAAVSLC 120 
GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 

MIIIIIIIMMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 180 
NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 

IIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 

NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 240 
YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

IIIIIIMIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIilllllllllll 

YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 300 

KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 360 

lllllllllllllllllllllllllillllllllllllllllllllllllllMIIIIII 

KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 360 

HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 420 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 
HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAI ITDFLDSGHG 420 

DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 480 

lllllllinillllllllllllllllllllllllllllllllllllllllllllllill 
DCLLDQPSKPI SLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 480 

QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHR VDG 517 

IIIIIIMIIIIMIIIIMIIIIMIIIMIII III 

QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRPPTDI ISPKQLLLRLPNGLHTTQVDG 540 

SWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGK 577 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIII 
SWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGK 600 

SFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWD 637 

IIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIlllllllll 
SFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWD 660 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


518 


Db 


541 


Qy 


578 


Db 


601 



Qy 638 GTLCSPDSTSVCVQGKCIKAGCDGNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYN 697 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 661 GTLCSPDSTSVCVQGKCIKAGCDGNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYN 720 

Qy 698 FWAIPAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLL 757 

llllllllllllilllllllllllllllllllllllllllllllllllllllllllllll 
Db 721 FWAIPAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLliNGHFWSAVERDLWKGSLL 780 

Qy 758 RYSGTGTAVESLQASRPILEPLTVEVLSVGKMTPPR 793 

llllllllllllllllllllllllllllllllllll 
Db 7 81 RYSGTGTAVESLQASRPILE PLTVEVLSVGKMTPPR 816 



RESULT 11 
US-10-093-463-30 

Sequence 30, Application US/10093463 
Publication No. US20030208039A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Padigaru, Muralidhara 
Shenoy , Suresh 
Kekuda, Ramesh 
Gusev, Vladimir 
Pochart, Pascal 
Zhong , Mei 
Rastelli, Luca 
Mezes, Peter 
Smiths on, Glennda 
Guo, Xiaojia 
Gerlach, Valerie 
Gasman, Stacie 
Boldog, Ferenc 
Li, Li 

Zerhusen, Bryan 
Tchernev, Velizar 
Gangolli, Esha 
Vernet , Corine 
Pena, Carol 
Burgess, Catherine 
Liu, Xiaohong 
Spytek, Kimberly 
Gorman, Linda 
Spaderna, Steven 
Voss , Edward 
Malyankar, Uriel 
Anderson, David 
Patturajan, Meera 
Miller, Charles 
Taupier, Raymond J. Jr. 

No. US20030208039Alel Antibodies that Bind to Antigenic Polypeptides, Nucleic 



APPLICANT 

TITLE OF INVENTION: 
Acids 

TITLE OF INVENTION: Encoding The Antigens 
FILE REFERENCE: 21402-290A (Cura 590AT) 
CURRENT APPLICATION NUMBER: US/10/093,463 
CURRENT FILING DATE: 2002-06-24 
PRIOR APPLICATION NUMBER: 60/283,675 
PRIOR FILING DATE: 2001-04-14 
PRIOR APPLICATION NUMBER: 60/338,092 
PRIOR FILING DATE: 2001-12-03 
PRIOR APPLICATION NUMBER: 60/274,281 
PRIOR FILING DATE: 2001-03-08 
PRIOR APPLICATION NUMBER: 60/274,101 
PRIOR FILING DATE: 2001-03-08 
PRIOR APPLICATION NUMBER: 60/325,681 
PRIOR FILING DATE: 2001-09-27 
PRIOR APPLICATION NUMBER: 60/304,354 
PRIOR FILING DATE: 2001-07-10 
PRIOR APPLICATION NUMBER: 60/279,995 
PRIOR FILING DATE: 2001-03-30 
PRIOR APPLICATION NUMBER: 60/294,899 
PRIOR FILING DATE: 2001-05-31 



and Methods of Use. 



; PRIOR APPLICATION NUMBER; 60/287,424 

; PRIOR FILING DATE: 2001-04-30 

; PRIOR APPLICATION NUMBER: 60/299,027 

; PRIOR FILING DATE: 2001-06-18 

; PRIOR APPLICATION NUMBER: 60/309,198 

; PRIOR FILING DATE: 2001-07-31 

; PRIOR APPLICATION NUMBER: 60/281,194 

; PRIOR FILING DATE: 2001-04-04 

; PRIOR APPLICATION NUMBER: 60/274,194 

; PRIOR FILING DATE: 2001-03-08 

; PRIOR APPLICATION NUMBER: 60/274,849 

; PRIOR FILING DATE: 2001-03-09 

; PRIOR APPLICATION NUMBER: 60/330,380 

; PRIOR FILING DATE: 2001-10-18 

; PRIOR APPLICATION NUMBER: 60/275,235 

; PRIOR FILING DATE: 2001-03-12 

; PRIOR APPLICATION NUMBER: 60/288,342 

; PRIOR FILING DATE: 2001-05-03 

; PRIOR APPLICATION NUMBER: 60/275,578 

; PRIOR FILING DATE: 2001-03-13 

; NUMBER OF SEQ ID NOS : 370 

; SOFTWARE: Patent In Ver. 2.1 

; SEQ ID NO 30 

LENGTH: 755 

TYPE: PRT 

; ORGANISM: Homo sapiens 
US-10-093-463-30 



Query Match 7 6.6%; Score 3957.5; DB 4; Length 755; 

Best Local Similarity 7 9.4%; Pred. No. 0; 

Matches 7 54; Conservative 0; Mismatches 1; Indels 195; Gaps 1; 



Qy 
Db 


1 
1 


MLLLGILTLAFAGRTAGGFEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 

Mllllllllllllllll lllllllllllllllllllllllllllllllllllllllll 
MLLLGILTLAFAGRTAGGSEPEREVWPIRLDPDINGRRYYWRGPEDSGDQGLIFQITAF 


60 
60 


Qy 
Db 


61 
61 


QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 

lllllllllllllllllllllllllllllllllllllllillllllllllllllllllll 
QEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGSSDLRRCFYSGDVNAEPDSFAAVSLC 


120 
120 


Qy 
Db 


121 
121 


GGLRGAFGYRGAEYVISPLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 

lillllllllllllllllllllllllllllllllllllllllllllllillllllllill 

GGLRGAFGYRGAEYVI S PLPNASAPAAQRNSQGAHLLQRRGVPGGPSGDPTSRCGVASGW 


180 
180 


Qy 
Db 


181 
181 


NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 

IIIIIIIIIIIIIIIIIIMIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

NPAILRALDPYKPRRAGFGESRSRRRSGRAKRFVSIPRYVETLWADESMVKFHGADLEH 


240 
240 


Qy 
Db 


241 
241 


YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIilllllllllllllllllll 

YLLTLLATAARLYRHPSILNPINIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLN 


300 
300 


Qy 


301 


KVSDKHPEYWDTAILFTRQDLCGATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTA 
IMIIIMIIIIIIIIIII 


360 


Db 


301 


319 


Qy 


361 


HELGHVFNMPHDNVKVCEEVFGKLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHG 


420 


Db 


320 




319 




Qy 


421 


DCLLDQPSKPISLPEDLPGASYTLSQQCELAFGVGSKPCPYMQYCTKLWCTGKAKGQMVC 


480 


Db 


320 




319 




Qy 


481 


QTRHFPWADGTSCGEGKLCLKGACVERHNLNKHRVDGSWAKWDPYGPCSRTCGGGVQLAR 

lllllllllllllllllllilillll 

VDGSWAKWDPYGPCSRTCGGGVQLAR 


540 


Db 


320 


345 


Qy 

Db 


541 
346 


RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 

lllllllllillllllllllllllllllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

RQCTNPTPANGGKYCEGVRVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLA 


600 
405 



Ov 


601 




406 


Ov 


661 


Db 


466 


Ov 
vy 


721 


Db 


526 


Ov 
vy 


781 


Db 


586 


Qy 


841 


Db 


646 


Qy 


901 


Db 


706 



VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 660 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIII 

VAWVPKYSGVSPRDKCKLICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCD 465 

GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLI 720 

llllllllllllllllllllllllllllllllllllllilllllllllllllllllllll 
GNLGSKKRFDKCGVCGGDNKSCKKVTGLFTKPMHGYNFWAI PAGASSIDIRQRGYKGLI 525 

GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 780 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GDDNYLALKNSQGKYLLNGHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLT 585 
VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 84 0 

liliilllllllllillllliiMllliiiililllllllllillllllllllililiil 

VEVLSVGKMTPPRVRYSFYLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPP 64 5 

ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGEPCPTWELS 900 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllill 
ARWVAGSWGPCSASCGSGLQKRAVDCRGSAGQRTVPACDAAHRPVETQACGE PCPTWELS 705 

AWSPCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 950 

llllllilllllllllllllllllllllllllllllllllllllllllll 
AWS PCSKSCGRGFQRRSLKCVGHGGRLLARDQCNLHRKPQELDFCVLRPC 755 



RESULT 12 
US-09-321-987B-4 

; Sequence 4, Application US/09321987B 
; Patent No. US20020102210A1 
; GENERAL INFORMATION: 

APPLICANT: Kimble, Judith E 

APPLICANT: Blelloch, Robert H 

TITLE OF INVENTION: Agent and Method for Modulating Cell Migration 

; FILE REFERENCE: 9602 96.953 86 

; CURRENT APPLICATION NUMBER: US/09/321,987B 

; CURRENT FILING DATE: 1999-05-28 

; PRIOR APPLICATION NUMBER: 60/087,170 

; PRIOR FILING DATE: 1998-05-29 

; PRIOR APPLICATION NUMBER: 60/129,023 

; PRIOR FILING DATE: 1999-04-13 

; NUMBER OF SEQ ID NOS : 5 

SOFTWARE: Patent In Ver. 2.1 
; SEQ ID NO 4 
; LENGTH: 950 

TYPE : PRT 

ORGANISM: Murine 
US-09-321-987B-4 



Query Match 48.1%; Score 2482.5; DB 3; Length 950; 

Best Local Similarity 48.0%; Pred. No.- 2.6e-199; 

Matches 478; Conservative 158; Mismatches 249; Indels 111; Gaps 22; 

MLLLG I LTLAF AGRTAGG - - FE PERE VW P I RLDPD I NGRR Y YWRGP - EDSGDQGLI FQI 5 7 

:||| :|: I I I I = hlH h 11111: = 

LLLLASITMLLCARGAHGRPTEEDEELVLP~SLE RAPGHDSTTTRL— RL 66 

TAFQEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGS SDLRRCFYSGDVNA 109 

II : :|| l|: III! h : ^ MM II IHH H 

DAFGQQLHLKLQPDSGFLAPGFTLQTV GRSPGSEAQHLDPTGDLAHCFYSGTVNG 121 

EPDSFAAVSLCGGLRGAFGYRGAEYVISPLPNAS APAAQRNSQGA HLLQRRG 161 

:| I Ihlll hllll :||: II I : III I hhil 



Qy 


1 


Db 


20 


Qy 


58 


Db 


67 


Qy 


110 


Db 


122 


Qy 


162 


Db 


181 


Qy 


207 



3PSGDPTSRCGVASGWNPAILRALDPYKPRRAGFGESRSRRR 206 

I ::||l I I W-- I 

-RRGSGGAKCGVMD DETLPTSDSRPESQNTRNQWPVRDPTPQDAGKP 226 



I I III!! Illllh:|lhll III: hllllll : III hllll I I 



Db 227 SGPGSIRKKRFVSSPRYVETMLVADQSMADFHGSGLKHYLLTLFSVAARFYKHPSIRNSI 286 



Qy 263 NIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLNKVSDKHPEYWDTAILFTRQDLC 322 

::||||:|:: : Ihll llllllllll llh I l|: I I I I I I I I I I I I I I 
Db 287 SLVWKIIiVIYEEQKGPEVTSNAALTLRNFCNWQKQHNSPSDRDPEHYDTAILFTRQDLC 346 

Qy 323 GATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTAHELGHVFNMPHDNVKVCEEVFG 382 

|: lllllllllllhlll lllllllllll :|||||||lllllllll|: \ \ \ 
Db 347 GSHTCDTLGMADVGTVCDPSRSCSVIEDDGLQAAFTTAHELGHVFNMPHDDAKHCASLNG 406 

Qy 383 KLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPISLPEDLPGASY 442 

:|:|: I :| : III III ::| I I I : I I I : I I : h I II II Mil I 
Db 407 VTGDSHLMASMLSSLDHSQPWSPCSAYMVTSFLDNGHGECLMDKPQNPIKLPSDLPGTLY 466 

Qy 443 TLSQQCELAFGVGSKPCP-YMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKLCLK 501 

::1|: II II II II I I I I I : I : I I I I : I I 1 I I I I I I I I I M I : 

Db 467 DANRQCQFTFGEESKHCPDAASTCTTLWCTGTSGGLLVCQTKHFPWADGTSCGEGKWCVS 526 

Qy 502 GACVERHNLNKH RVDGSWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGV 558 

I II : II I III I M llllllllll hi M I IMIIIII 

Db 527 GKCVNKTDM-KHFATPVHGSWGPWGPWGDCSRTCGGGVQYTMRECDNPVPKNGGKYCEGK 585 

Qy 559 RVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKL 618 

||:|lll|:| II : ^Ihllllllll I II I I I : I I I I = I : II I 

Db 586 RVRYRSCNIEDCPDN-NGKTFREEQCEAHNEFSKASFGNEPTVEWTPKYAGVSPKDRCKL 644 

Qy 619 ICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCDGNLGSKKRFDKCGVCGGD 678 

I I I llhll lllllll llllillllllhhillll : llhMllllllh 

Db 645 TCEAKGIGYFFVLQPKWDGTPCSPDSTSVCVQGQCVKAGCDRIIDSKKKFDKCGVCGGN 704 

Qy 679 NKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLN 738 

:|lt::|: I Ih :|llll|::h:: I =1 : : = II : : : II : II 

Db 705 GSTCKKMSGIVTSTRPGYHDIVTIPAGATNIEVKHRNQRGSRNNGSFLAIRAADGTYILN 764 

Qy 739 GHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLTVEVLSVGKMTPPRVRYSF 798 

hi :| :|:M ll-llllh |:| h llll-ll II I--- 

Db 765 GNFTLSTLEQDLTYKGTVLRYSGSSAALERIRSFSPLKEPLTIQVLMVGHALRPKIKFTY 824 

Qy 799 YLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPPARWVAGSWGPCSASCGSG 858 

:: |: |:: : I II II II Hill 

Db 825 FMKKKTES FNAIPTFS EWVIEEWGECSKTCGSG 857 

Qy 859 LQKRAVDCRGSAGQRTVPACDA7^--RPVETQACGE-PCPTWELSAWSPCSKSCGRGFQR 915 

hi I II I II : I :| h I : III \ lllllhlhh:: 

Db 858 WQRRWQCRDINGH PASECAKEVKPASTRPCADLPCPHWQVGDWSPCSKTCGKGYKK 914 

Qy 916 RSLKCVGHGGRLLARDQCNLHRKPQE-LDFCVLRPC 950 

hllll II :h : h :th =111 1 I 
Db 915 RTLKCVSHDGGVLSNESCDPLKKPKHYIDFCTLTQC 950 



RESULT 13 
US-10-381-793-3 

; Sequence 3, Application US/10381793 
; Publication No. US20040091965A1 
; GENERAL INFOFIMATION : 
; APPLICANT: Bayer AG 

; TITLE OF INVENTION: REGULATION OF HUMAN ADAM-TS-LIKE PROTEIN 

; FILE REFERENCE: LI0152 Foreign Countries 

; CURRENT APPLICATION NUMBER: US/10/381,793 

; CURRENT FILING DATE: 2003-03-2 8 

; PRIOR APPLICATION NUMBER: US 60/235,881 

; PRIOR FILING DATE: 2000-09-28 

PRIOR APPLICATION NUMBER: US 60 /XXX, XXX 
; PRIOR FILING DATE: 2001-07-24 
; NUMBER OF SEQ ID NOS : 14 
; SOFTWARE: Patent In version 3.1 
; SEQ ID NO 3 

LENGTH: 951 

TYPE: PRT 
; ORGANISM: Mus musculus 



US-10-381-793-3 



Query Match 48.1%; Score 2482.5; DB 4; Length 951; 

Best Local Similarity 48.0%; Pred. No. 2.6e-199; 

Matches 478; Conservative 158; Mismatches 249; Indels 111; Gaps 22; 

Qy 1 MLLLGILTLAFAGRTAGG--FEPEREVWPIRLDPDINGRRYYWRGP-EDSGDQGLIFQI 57 

:|ll :|: III I : hhl h M II I 

Db 20 LLLLASITMLLCARGAHGRPTEEDEELVLP-SLE RAPGHDSTTTRL--RL 66 

Qy 58 TAFQEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGS SDLRRCFYSGDVNA 109 

II : :|l II: Mil h : : Mil II lllll II 

Db 67 DAFGQQLHLKLQPDSGFLAPGFTLQTV GRSPGSEAQHLDPTGDLAHCFYSGTVNG 121 

Qy 110 EPDSFAAVSLCGGLRGAFGYRGAEYVISPLPNAS APAAQRNSQGA HLLQRRG 161 

:| I Ihlll hllll :| h I I I : III I hhll 

Db 122 DPGSAAALSLCEGVRGAFYLQGEEFFIQPAPGVATERLAPAVPEEESSARPQFHILRRR- 180 

Qy 162 VPGGPSGDPTSRCGVASGWNPAILRALDPYKPRRAGFGESRSRRR 206 

I ::||l I I II:: I 

Db 181 RRGSGGAKCGVMD DETLPTSDSRPESQNTRNQWPVRDPTPQDAGKP 226 

Qy 207 SGRAKRFVSIPRYVETLWADESMVKFHGADLEHYLLTLLATAARLYRHPSILNPI 262 

I I lllll lllll|::|||:|| Mh hllllll = III hllll I I 
Db 227 SGPGSIRKKRFVSSPRYVETMLVADQSMADFHGSGLKHYLLTLFSVAARFYKHPSIRNSI 286 

Qy 263 NIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLNKVSDKHPEYWDTAILFTRQDLC 322 

::||l|:|:: : Ihll llllllllll llh I Ih IhUIIIIMIIIII 
Db 287 SLWVKILVIYEEQKGPEVTSNAALTLRNFCNWQKQHNSPSDRDPEHYDTAILFTRQDLC 34 6 

Qy 323 GATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTAHELGHVFNMPHDNVKVCEEVFG 382 

|: IIIIMMIIIhlll lllllllllll :|IIIIIIIIIIIMIIh I I : I 
Db 347 GSHTCDTLGMADVGTVCDPSRSCSVIEDDGLQAAFTTAHELGHVFNMPHDDAKHCASLNG 406 

Qy 383 KLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPISLPEDLPGASY 442 

:|:|: I :| : III III ::| MhUhlhhl II II MM I 
Db 407 VTGDSHLMASMLSSLDHSQPWSPCSAYMVTSFLDNGHGECLMDKPQNPIKLPSDLPGTLY 466 

Qy 443 TLSQQCELAFGVGSKPCP-YMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKLCLK 501 

::||: II M II II Mill = I : II I I : I II I II II M I I M h 

Db 467 DANRQCQFTFGEESKHCPDAASTCTTLWCTGTSGGLLVCQTKHFPWADGTSCGEGKWCVS 526 

Qy 502 GACVERHNLNKH RVDGSWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGV 558 

I II : II I Ml I hi IIIIIIMII hi II I IIMIIM 

Db 527 GKCVNKTDM- KHFATPVHGSWG PWGPWGDCSRTCGGGVQ YTMRECDNPVPKNGGK YCEGK 585 

Qy 559 RVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKL 618 

IhMMhl II ■ Mhllllllll I II MhlllhhIII 

Db 586 RVRYRSCNIEDCPDN-NGKTFREEQCEAHNEFSKASFGNEPTVEWTPKYAGVSPKDRCKL 644 

Qy 619 ICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCDGNLGSKKRFDKCGVCGGD 678 

I I I llhll IMIIM IIMIIIIIMhhIMM : MhMllllllh 

Db 645 TCEAKGIGYFFVLQPKWDGTPCSPDSTSVCVQGQCVKAGCDRIIDSKKKFDKCGVCGGN 704 

Qy 679 NKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLN 738 

:||h:h I Ih :| IMM-h- I M = :Mh: = I hll 

Db 705 GSTCKKMSGIVTSTRPGYHDIVTIPAGATNIEVKHRNQRGSRNNGSFLAIRAADGTYILN 764 

Qy 739 GHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLTVEVLSVGKMTPPRVRYSF 798 

hi :| :hll IhMlllh hi ::: h lllhMI II h::::: 
Db 765 GNFTLSTLEQDLTYKGTVLRYSGSSAALERIRSFSPLKEPLTIQVLMVGHALRPKIKFTY 824 

Qy 799 YLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPPARWVAGSWGPCSASCGSG 858 

:: h h: : I II II M Mill 

Db 825 FMKKKTES FNAIPTFS EWVIEEWGECSKTCGSG 857 

Qy 859 LQKRAVDCRGSAGQRTVPACDAAH'-RPVETQACGE-PCPTWELSAWSPCSKSCGRGFQR 915 

hi III I mi M h I : III h: Mlllhlhh:: 

Db 858 WQRRWQCRDINGH PASECAKEVKPASTRPCADLPCPHWQVGDWSPCSKTCGKGYKK 914 

Qy 916 RSLKCVGHGGRLLARDQCNLHRKPQE-LDFCVLRPC 950 



|:|||| I I :|: : h :||: W \ I I 
Db 915 RTLKCVSHDGGVLSNESCDPLKKPKHYIDFCTLTQC 950 



RESULT 14 
US-10-163-316-7 

Sequence 7, Application US/10163316 
Publication No. US20020197703A1 
GENERAL INFORMATION: 
APPLICANT: Kapeller-Libermann, Rosana 

TITLE OF INVENTION: 65552, A Human Matrix Metalloproteinase and Uses 
TITLE OF INVENTION: Therefor 
FILE REFERENCE: MPI01-025P1RNM 
CURRENT APPLICATION NUMBER: US/10/163,316 
CURRENT FILING DATE: 2002-06-05 
PRIOR APPLICATION NUMBER: 60/297,863 
PRIOR FILING DATE: 2001-06-13 
NUMBER OF SEQ ID NOS : 10 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 7 
LENGTH: 968 
TYPE: PRT 

ORGANISM: Mus musculus 
US-10-163-316-7 

Query Match 48.1%; Score 2482.5; DB 4; Length 968; 

Best Local Similarity 48.0%; Pred. No. 2.6e-199; 

Matches 47 8; Conservative 158; Mismatches 249; Indels 111; Gaps 22; 

Qy 1 MLLLGILTLAFAGRTAGG--FEPEREWVPIRLDPDINGRRYYWRGP-EDSGDQGLIFQI 57 

:|M :|: III I : hhl h I I II I 

Db 37 LLLLASITMLLCARGAHGRPTEEDEELVLP-SLE RAPGHDSTTTRL— RL 83 

Qy 58 TAFQEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGS SDLRRCFYSGDVNA 109 

II : :|| Ih MM |: : : I : M II Mill II 

Db 84 DAFGQQLHLKLQPDSGFLAPGFTLQTV GRSPGSEAQHLDPTGDLAHCFYSGTVNG 138 

Qy 110 EPDSFAAVSLCGGLRGAFGYRGAEYVISPLPNAS APAAQRNSQGA HLLQRRG 161 

:| I Ihlll hllll :||: M I : III I IMMI 

Db 139 DPGSAAALSLCEGVRGAFYLQGEEFFIQPAPGVATERLAPAVPEEESSARPQFHILRRR- 197 

Qy 162 VPGGPSGDPTSRCGVASGWNPAILRALDPYKPRRAGFGESRSRRR 206 

I ::||| I I M:: I 

Db 198 RRGSGGAKCGVMD DETLPTSDSRPESQNTRNQWPVRDPTPQDAGKP 243 

Qy 207 SGRAKRFVSIPRYVETLWADESMVKFHGADLEHYLLTLLATAARLYRHPSILNPI 262 

I I Mill MIIM::Mhll III: hllMII : Ml hllll I I 
Db 244 SGPGSIRKKRFVSSPRYVETMLVADQSMADFHGSGLKHYLLTLFSVAARFYKHPSIRNSI 303 

Qy 263 NIVWKVLLLRDRDSGPKVTGNAALTLRNFCAWQKKLNKVSDKHPEYWDTAILFTRQDLC 322 

::||||:|:: : Ihll IIMIIMII Mh I Ih IhMIIIIIIIIIII 
Db 304 SLWVKILVIYEEQKGPEVTSNAALTLRNFCNWQKQHNSPSDRDPEHYDTAILFTRQDLC 363 

Qy 323 GATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTAHELGHVFNMPHDNVKVCEEVFG 382 

h IIMMMIIIhlll MIIIIMIM MIIIIIIIIMMIIIh \ \ ■■ \ 

Db 364 GSHTCDTLGMADVGTVCDPSRSCSVIEDDGLQAAFTTAHELGHVFNMPHDDAKHCASLNG 423 

Qy 383 KLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPISLPEDLPGASY 442 

:hh I :| : Ml III :M llhMIMhhl II M MM I 
Db 424 VTGDSHLMASMLSSLDHSQPWSPCSAYMVTSFLDNGHGECLMDKPQNPIKLPSDLPGTLY 4 83 

Qy 443 TLSQQCELAFGVGSKPCP-YMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKLCLK 5 01 

::|h M M II M MMI : I HMhMMMMMMM h 

Db 484 DANRQCQFTFGEESKHCPDAASTCTTLWCTGTSGGLLVCQTKHFPWADGTSCGEGKWCVS 54 3 

Qy 502 GACVERHNLNKH RVDGSWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGV 5 58 

I II : :: II I III I Ml MIIMMII hi II I IMIIIM 

Db 544 GKCVNKTDM-KHFATPVHGSWGPWGPWGDCSRTCGGGVQYTMRECDNPVPKNGGKYCEGK 602 



Qy 



559 RVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKL 618 
Ihllllhl II : :||:|||lllll I II llhlllhhill 



Db 



603 RVRYRSCNIEDCPDN-NGKTFREEQCEAHNEFSKASFGNEPTVEWTPKYAGVSPKDRCKL 661 



Qy 619 ICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCDGNLGSKKRFDKCGVCGGD 678 

I I I |||:|| lllllll lllllllllllhhillll : llhlllllllll: 
Db 662 TCEAKGIGYFFVLQPKWDGTPCSPDSTSVCVQGQCVKAGCDRIIDSKKKFDKCGVCGGN 721 

Qy 679 NKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLN 738 

:|M::|: I 11= H llllh:h:: I 0 : • W I IHI 

Db 722 GSTCKKMSGIVTSTRPGYHDIVTIPAGATNIEVKHRNQRGSRNNGSFLAIRAADGTYILN 781 

Qy 739 GHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPLTVEVLSVGKMTPPRVRYSF 798 

|:| :i :|:|| l|::|lll|: hi = = : h lllh:|l II h::::: 
Db 782 GNFTLSTLEQDLTYKGTVLRYSGSSAALERIRSFSPLKEPLTIQVLMVGHALRPKIKFTY 841 

Qy 799 YLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPPARWVAGSWGPCSASCGSG 858 

:: |: |:: : I II II II '-WW 

Db 842 FMKKKTES FNAIPTFS EWVIEEWGECSKTCGSG 874 

Qy 859 LQKRAVDCRGSAGQRTVPACDAAH--RPVETQACGE-PCPTWELSAWSPCSKSCGRGFQR 915 

hi I II I II : I :| |: I : III h: I I I I I I : I I : I : : : 

Db 875 WQRRWQCRDINGH PASECAKEVKPASTRPCADLPCPHWQVGDWSPCSKTCGKGYKK 931 

Qy 916 RSLKCVGHGGRLLARDQCNLHRKPQE-LDFCVLRPC 950 

hllll I I :i: : h :|ll I I 

Db 932 RTLKCVSHDGGVLSNESCDPLKKPKHYIDFCTLTQC 967 



RESULT 15 
US-10-391-364-82 

Sequence 82, Application US/10391364 
Publication No, US20040121349A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
TITLE OF INVENTION 
TITLE OF INVENTION 
TITLE OF INVENTION 



Millennium Pharmaceuticals, Inc. 
Meyers, Rachel E. 
Carroll, Joseph M. 
Cook, William James 
Kapeller-Libermann, Rosana 
Welch, Nadine S. 
Bandaru, Rajasekhar 

NOVEL 27877, 18080, 14081, 32140, 50352, 
16658, 14223, 16002. 50566, 65552 AND 65577 MOLECULES AND 
USES THEREFOR 
FILE REFERENCE: MPI03-019OMNIM 
CURRENT APPLICATION NUMBER: US/10/391,364 
CURRENT FILING DATE: 2003-03-18 
PRIOR APPLICATION NUMBER: US 09/950,370 
PRIOR FILING DATE: 2001-09-10 
PRIOR APPLICATION NUMBER: US 60/231,084 
PRIOR FILING DATE: 2000-09-08 
PRIOR APPLICATION NUMBER: US 10/294,039 
PRIOR FILING DATE: 2002-11-13 
PRIOR APPLICATION NUMBER: US 60/338,587 
PRIOR FILING DATE: 2001-11-13 
PRIOR APPLICATION NUMBER: US 10/266,035 
PRIOR FILING DATE: 2002-10-07 
PRIOR APPLICATION NUMBER: US 60/328,198 
PRIOR FILING DATE: 2001-10-09 
PRIOR APPLICATION NUMBER: US 09/717,926 
PRIOR FILING DATE: 2000-11-21 
PRIOR APPLICATION NUMBER: US 60/214,707 
PRIOR FILING DATE: 2000-06-27 
PRIOR APPLICATION NUMBER: US 10/268,036 
PRIOR FILING DATE: 2002-10-09 
PRIOR APPLICATION NUMBER: US 60/327,820 
PRIOR FILING DATE: 2001-10-09 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 93 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 82 
LENGTH: 968 
TYPE : PRT 



ORGANISM: Mus musculus 
US-10-391-364-82 



Query Match 48.1%; Score 2482.5; DB 4; Length 968; 

Best Local Similarity 48.0%; Pred. No. 2.6e-199; 

Matches 478; Conservative 158; Mismatches 249; Indels 111; Gaps 22; 

Qy 1 MLLLGILTLAFAGRTAGG — FEPEREVWPIRLDPDINGRRY YWRGP-EDSGDQGLIFQI 57 

:||| :|: I I I I : h I I II I 

Db 37 LLLLASITMLLCARGAHGRPTEEDEELVLP-SLE RAPGHDSTTTRL--RL 83 

Qy 58 TAFQEDFYLHLTPDAQFLAPAFSTEHLGVPLQGLTGGS- --SDLRRCFYSGDVNA 109 

II : :| I Ih Mil \- ■ • I : II II HIM M 

Db 84 DAFGQQLHLKLQPDSGFLAPGFTLQTV GRSPGSEAQHLDPTGDLAHCFYSGTVNG 138 

Qy 110 EPDSFAAVSLCGGLRGAFGYRGAEYVISPLPNAS APAAQRNSQGA HLLQRRG 161 

:| I Ihlll hllll :| h I I I : III I IMMI 

Db 139 DPGSAAALSLCEGVRGAFYLQGEEFFIQPAPGVATERLAPAVPEEESSARPQFHILRRR- 197 

Qy 162 VPGGPSGDPTSRCGVASGWNPAILRALDPYKPRRAGFGESRSRRR 206 

I ::|ll I I II:: I 

Db 198 RRGSGGAKCGVMD DETLPTSDSRPESQNTRNQWPVRDPTPQDAGKP 243 

Qy 207 SGRAKRFVSIPRYVETLWADESMVKFHGADLEHYLLTLLATAARLYRHPSILNPI 262 

I I IIIM lllllh:|||:|l Ml: hllllll : III IMMI I I 
Db 244 SGPGSIRKKRFVSSPRYVETMLVADQSMADFHGSGLKHYLLTLFSVAARFYKHPSIRNSI 303 

Qy 263 NI\AA^VLLLRDRDSGPKVTGNAALTLRNFCAWQKKLNKVSDKHPEYWDTAILFTRQDLC 322 

::||l|:|:: : IhM lllillllll llh I Ih lh:|IIIIIIIIMI 
Db 304 SLVWKILVIYEEQKGPEVTSNAALTLRNFCNWQKQHNSPSDRDPEHYDTAILFTRQDLC 363 

Qy 323 GATTCDTLGMADVGTMCDPKRSCSVIEDDGLPSAFTTAHELGHVFNMPHDNVKVCEEVFG 3 82 

h lllllllllllhlll lllllllllll :||llllllllllllll|: I I : I 
Db 364 GSHTCDTLGMADVGTVCDPSRSCSVIEDDGLQAAFTTAHELGHVFNMPHDDAKHCASLNG 423 

Qy 383 KLRANHMMSPTLIQIDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPISLPEDLPGASY 442 

:|:|: I :| : Ml III ::| llhllhlhhl II II MM I 
Db 424 VTGDSHLMASMLSSLDHSQPWSPCSAYMVTSFLDNGHGECLMDKPQNPIKLPSDLPGTLY 483 

Qy 443 TLSQQCELAFGVGSKPCP-YMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKLCLK 501 

::||: II II II II IIIM : I : I I I h II II M I M II II I h 

Db 484 DANRQCQFTFGEESKHCPDAASTCTTLWCTGTSGGLLVCQTKHFPWADGTSCGEGKWCVS 543 

Qy 502 GACVERHNLNKH RVDGSWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGV 558 

III : :: II I III I hi llllllllll hi II I MMIIM 

Db 544 GKCVNKTDM-KHFATPVHGSWGPWGPWGDCSRTCGGGVQYTMRECDNPVPKNGGKYCEGK 602 

Qy 559 RVKYRSCNLEPCPSSASGKSFREEQCEAFNGYNHSTNRLTLAVAWVPKYSGVSPRDKCKL 618 

IhlMlhl II : :|hlllMlli 1 :: :: II llhlllhhIM 

Db 603 RVRYRSCNIEDCPDN-NGKTFREEQCEAHNEFSKASFGNEPTVEWTPKYAGVSPKDRCKL 661 

Qy 619 ICRANGTGYFYVLAPKWDGTLCSPDSTSVCVQGKCIKAGCDGNLGSKKRFDKCGVCGGD 678 

I I I llhll lllllll lllllllllllhhillll : MhlMIMIIh 
Db 662 TCEAKGIGYFFVLQPKWDGTPCSPDSTSVCVQGQCVKAGCDRIIDSKKKFDKCGVCGGN 721 

Qy 679 NKSCKKVTGLFTKPMHGYNFWAIPAGASSIDIRQRGYKGLIGDDNYLALKNSQGKYLLN 7 38 

:Mh:h I Ih :| MMh:|::: I M = -Ih: = I hll 

Db 722 GSTCKKMSGIVTSTRPGYHDIVTIPAGATNIEVKHRNQRGSRNNGSFLAIRAADGTYILN 7 81 

Qy 7 39 GHFWSAVERDLWKGSLLRYSGTGTAVESLQASRPILEPbTVEVLSVGKMTPPRVRYSF 798 

hi :| :hl| lh:|Mlh hi ::: h lllhMI II h::::^ 
Db 7 82 GNFTLSTLEQDLTYKGTVLRYSGSSAALERIRSFSPLKEPLTIQVLMVGHALRPKIKFTY 841 

Qy 7 99 YLPKEPREDKSSHPKDPRGPSVLHNSVLSLSNQVEQPDDRPPARWVAGSWGPCSASCGSG 858 

h h: : I MM II :MM 

Db 842 FMKKKTES FNAIPTFS EWVIEEWGECSKTCGSG 874 

Qy 859 LQKRAVDCRGSAGQRTVPACDAAH--RPVETQACGE-PCPTWELSAWSPCSKSCGRGFQR 915 

hi III I II : I M h I : III h: IIMIhlhh:: 

Db 875 WQRRWQCRDINGH PASECAKEVKPASTRPCADLPCPHWQVGDWSPCSKTCGKGYKK 931 



Qy 916 RSliKCVGHGGRLLARDQCNLHRKPQE-LDFCVLRPC 950 

hllll I I :|: : h I I 

Db 932 RTLKCVSHDGGVLSNESCDPLKKPKHYIDFCTLTQC 967 



Search completed: August 5, 2006, 00:29:46 
Job time ; 190 sees 



SCORE Search Results Details for Application 10763210 and Search Result us-ll 

763-210-1. rai. 



Score Home Page Retrieve Application List SCORE System Overview SCORE FAO Comments / Suaaestions 

This page gives you Search Results detail for the Application 10763210 and Search Result us-10-763-210-l.rai. 
s^ 



GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



Go Back to previous 



OM protein - protein search, using sw model 
Run on: August 5, 2006, 00:14:04 ; 



Search time 52 Seconds 
(without alignments) 
1599.117 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-763-210~l 
5164 

1 MLLLGILTLAFAGRTAGGFE DQCNLHRKPQELDFCVLRPC 9 50 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 650591 seqs, 87530628 residues 

Total number of hits satisfying chosen parameters; 



650591 



Minimum DB seq length: 
Maximum DB seq length: 



2000000000 



Post-processing : 



Database 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

Issued_Patents_AA: * 

1 : /EMC_Celerra_SIDS3/ptodata/2/iaa/5_COMB.pep: * 

2 : /EMC_Celerra_SIDS3/ptodata/2/iaa/6_COMB.pep: * 

3 : /EMC_Celerra_SIDS3 /ptodata/2/iaa/7_COMB . pep : * 

4 : /EMC_Celerra_SIDS3/ptodata/2/iaa/H_COMB-pep: * 

5 : /EMC_Celerra_SIDS3/ptodata/2/iaa/PCTUS_COMB.pep: * 

6 : /EMC_Celerra_SIDS3/ptodata/2/iaa/RE_COMB.pep: * 

7 : /EMC_Celerra_SIDS3/ptodata/2/iaa/backf ilesl .pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



No. 


Score 


Match 


Length 


DB 


ID 








Description 




1 


5164 


100.0 


950 


2 


US- 


10 


-009 


-332-1 


Sequence 1 , 


Appli 


2 


2482.5 


48.1 


950 


2 


US- 


09 


-321 


-987B-4 


Sequence 4, 


Appli 


3 


2480.5 


48.0 


949 


2 


US- 


09 


-568 


-559-2 


Sequence 2, 


Appli 


4 


2480.5 


48.0 


967 


2 


us- 


09 


-130 


-491-2 


Sequence 2, 


Appli 


5 


2274 


44.0 


727 


2 


us- 


09 


-445 


-023A-1 


Sequence 1 , 


Appli 



6 


2273 


44 


0 


727 


2 


US- 


09 


-445 


-023A-12 


Sequence 


12, 


Appl 


7 


2135 . 5 


41 


4 


890 


2 


US- 


09 


-949 


-002-394 


Sequence 


394, 


App 


8 


2132 . 5 


41 


3 


887 


2 


US- 


09 


-949 


-002-534 


Secjuence 


534, 


App 


9 


2113 


40 


9 


905 


2 


US - 


09 


-369 


-364A-9 


Sequence 


9, Appli 


10 


1917 


37 


1 


837 


2 


US- 


09 


-122 


-126B-2 


Sequence 


2, Appli 


11 


1917 


37 


1 


837 


2 


US- 


09 


-634 


-286A-2 


Secfuence 


2, Appli 


12 


1917 


37 


1 


837 


2 


US- 


10 


-247 


-685-2 


Sequence 


2, Appli 


13 


1917 


37 


1 


849 


2 


US- 


09 


-949 


-002-564 


Sequence 


564, 


App 


14 


1916 


37 


1 


837 


2 


US- 


10 


-012 


-231A-317 


Sequence 


317, 


App 


15 


1916 


37 


1 


837 


2 


US- 


10 


-015 


-389A-317 


Sequence 


317, 


App 


16 


1916 


37 


1 


837 


2 


us- 


10 


-006 


-768A-317 


Sequence 


317, 


App 


17 


1916 


37 


1 


837 


2 


us- 


10 


-015 


-671A-317 


Sequence 


317, 


App 


18 


1916 


37 


1 


837 


2 


us- 


10 


-015 


-393A-317 


Sequence 


317, 


App 


19 


1916 


37 


1 


837 


2 


us- 


10 


-Oil 


-833A-317 


Sequence 


317, 


App 


20 


1916 


37 


1 


837 


2 


us- 


10 


-006 


-041A-317 


Sequence 


317, 


App 


21 


1916 


37 


1 


837 


2 


us- 


10 


-012 


-064A-317 


Sequence 


317, 


App 


22 


1916 


37 


1 


837 


2 


us- 


10 


-015 


-392A-317 


Sequence 


317, 


App 


23 


1916 


37 


1 


837 


3 


us- 


10 


-Oil 


-795B-317 


Sequence 


317, 


App 


24 


1916 


37 


1 


837 


3 


us- 


10 


-015 


-386A-317 


Sequence 


317, 


App 


25 


1916 


37 


1 


837 


3 


us- 


10 


-012 


-121A-317 


Sequence 


317, 


App 


26 


1916 


37 


1 


837 


3 


us- 


10 


-006 


-485A-317 


Sequence 


317, 


App 


27 


1916 


37 


1 


837 


3 


us- 


10 


-006 


-746A-317 


Sequence 


317, 


App 


28 


1916 


37 


1 


837 


3 


us- 


10 


-012 


-752A-317 


Sequence 


317, 


App 


29 


1916 


37 


1 


837 


3 


us- 


10 


-017 


-253A-317 


Sequence 


317, 


App 


30 


1916 


37 


1 


837 


3 


us- 


10 


-015 


-519A-317 


Sec[uence 


317, 


App 


31 


1916 


37 


1 


837 


3 


us- 


10 


-015 


-715A-317 


Sequence 


317, 


App 


32 


1916 


37 


1 


837 


3 


us- 


10 


-007 


-236A-317 


Sequence 


317, 


App 


33 


1915 


37 


1 


837 


2 


us- 


09 


-949 


-002-380 


Sec[uence 


380, 


App 


34 


1900 


36 


8 


608 


2 


us- 


09 


-130 


-491-13 


Sequence 


13, 


Appl 


35 


1854 


35 


9 


930 


2 


us- 


09 


-369 


-364A-2 


Sequence 


2, Appli 


36 


1848 . 5 


35 


8 


1882 


2 


us- 


09 


-369 


-364A-13 


Sequence 


13, 


Appl 


37 


1847 . 5 


35 


8 


930 


2 


us- 


09 


-122 


-126B-15 


Sequence 


15, 


Appl 


38 


1847 . 5 


35 


8 


930 


2 


us- 


09 


-634 


-286A-15 


Sequence 


15, 


Appl 


39 


1847 . 5 


35 


8 


930 


2 


us- 


10 


-247 


-685-15 


Sequence 


15, 


Appl 


40 


1798 . 5 


34 


8 


874 


2 


us- 


09 


-369 


-364A-15 


Sequence 


15, 


Appl 


41 


1607 


31 


1 


551 


2 


us- 


09 


-130 


-491-16 


Sequence 


16, 


Appl 


42 


1593. 5 


30 


9 


2150 


2 


us- 


09 


-321 


-987B-2 


Sequence 


2, Appli 


43 


1585 


30 


7 


2165 


2 


us- 


09 


-800 


-729-155 


Sequence 


155, 


App 


44 


1304 


25 


3 


1082 


2 


us- 


10 


-363 


-937-17 


Sequence 


17, 


Appl 


45 


1302 


25 


,2 


1104 


2 


us- 


09 


-981 


-953A-4 


Sequence 


4 , Appl i 



ALIGNMENTS 



RESULT 1 
US-10-009-332-1 

; Sequence 1, Application US/10009332 
; Patent No. 6716613 
; GENERAL INFORMATION: 

APPLICANT: Yamanouchi Pharmaceutical Co., Ltd. 
; APPLICANT: Kazusa DNA Research Institute 

TITLE OF INVENTION: NOVEL METALLOPROTEASE HAVING AGGRECANASE ACTIVITY 

FILE REFERENCE: Q67541 

CURRENT APPLICATION NUMBER: US/10/009,332 
; CURRENT FILING DATE: 2001-12-10 
; PRIOR APPLICATION NUMBER: JPA Hei 11-321740 
; PRIOR FILING DATE: 1999-11-11 

PRIOR APPLICATION NUMBER: JPA 2000-144020 

PRIOR FILING DATE: 2000-05-16 
; NUMBER OF SEQ ID NOS : 35 



This page gives you Search Results detail for the Application 10763210 and Search Result us-10-763-210-l.rapbn. 
start 

Go Back to previous. page 



GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM protein - protein search, using sw model 

Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



August 5, 2006, 00:26:50 ; Search time 36 Seconds 

(without alignments) 
1765.787 Million cell updates/sec 

US-10-763-210-1 
5164 

1 MLLLGILTLAFAGRTAGGFE DQCNLHRKPQELDFCVLRPC 950 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



236815 seqs, 66914042 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



236815 



Database : 



Published_Appli 

1: /EMC_Celerra 

2: /EMC_Celerra 

3: /EMC^Celerra. 

4 : /EMC_Celerra. 

5: /EMC_Celerra 

6 : /EMC_Celerra 

7: /EMC_Celerra. 

8 : /EMC_Celerra. 



cations_AA_New: * 

._SIDS3 /ptodata/ l/pubpaa/US09_NEW_PUB . pep : * 
SIDS3 /ptoda ta / 1 /pubpaa /US 0 6_NEW_PUB . pep : * 
._SIDS3/ptodata/l/pubpaa/US07_NEW_PUB.pep: * 
._SIDS3/ptodata/l/pubpaa/US08_NEW_PUB.pep: * 

BIDS 3 /ptodata/l/pubpaa/PCT_NEW_PUB . pep : * 
_SIDS3 /ptoda ta / 1 /pubpaa /US 1 0_NEW_PUB . pep : * 
._SIDS3/ptodata/l/pubpaa/USll_NEW_PUB,pep: * 
._SIDS3/ptodata/l/pubpaa/US60_NEW_PUB.pep: * 



Fred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 










Description 


1 


1916 


37 


1 


837 


6 


US- 


10 


-196 


-749 


-352 


Sequence 352, App 


2 


1183 


22 


9 


1201 


7 


US- 


11 


-300 


-928 


-40 


Sequence 40, Appl 


3 


551. 5 


10 


7 


525 


6 


US- 


10 


-196 


-749 


-436 


Sequence 436, App 


4 


349.5 


6 


8 


813 


6 


US- 


10 


-196 


-749 


-466 


Sequence 466, App 


5 


317 


6 


1 


737 


7 


US- 


11 


-193 


-353 


-4 


Sequence 4, Appli 


6 


314 


6 


1 


735 


6 


US- 


10 


-196 


-749 


-88 


Sequence 88, Appl 


7 


314 


6 


1 


909 


7 


US- 


11 


-226 


-554 


-80 


Sequence 80, Appl 


8 


314 


6 


1 


909 


7 


us- 


11 


-248 


-718 


-80 


Sequence 80, Appl 


9 


276.5 


5 


4 


5738 


6 


us- 


10 


-505 


-928 


-150 


Sequence 150, App 


10 


274 


5 


3 


763 


6 


us- 


10 


-982 


-908 


-28 


Sequence 28, Appl 



> - 



11 


255 


4 


.9 


820 


6 


US- 


10 


-982 


-908 


-26 


Sequence 


26, 


Appl 


12 


254.5 


4, 


.9 


5635 


6 


US- 


10 


-766 


-760 


-2 


Secfuence 


2, 


Appli 


13 


237 


4, 


.6 


832 


6 


US- 


10 


-505 


-928 


-491 


Sequence 


491, App 


14 


214 


4, 


. 1 


734 


7 


US- 


11 


-238 


-282 


-19 


Sequence 


19, 


Appl 


15 


214 


4, 


.1 


734 


7 


US- 


11 


-300 


-928 


-11 


Sequence 


11, 


Appl 


16 


173 


3, 


.4 


765 


7 


US- 


11 


-165 


-586 


-36 


Sequence 


36, 


Appl 


17 


171.5 


3 


.3 


1403 


7 


US- 


11 


-217 


-997 


-12 


Sequence 


12, 


Appl 


18 


166.5 


3 


.2 


934 


7 


US - 


11 


-247 


-437 


-10 


Sequence 


10, 


Appl 


19 


166. 5 


3, 


,2 


934 


7 


US- 


11 


-033 


-476 


-5 


Sequence 


5, 


Appli 


20 


164 


3 , 


.2 


943 


7 


US - 


11 


-370 


-424 


-7 


Sequence 


7, 


Appli 


21 


162 


3, 


.1 


734 


7 


us- 


11 


-165 


-586 


-45 


Sequence 


45, 


Appl 


22 


162 


3, 


.1 


1398 


7 


us- 


11 


-217 


-997 


-4 


Sequence 


4, 


Appli 


23 


162 


3, 


.1 


1404 


7 


us- 


11 


-217 


-997 


-2 


Sequence 


2, 


Appli 


24 


162 


3, 


.1 


1547 


7 


us- 


11 


-217 


-997 


-22 


Sequence 


22, 


Appl 


25 


162 


3, 


.1 


1577 


7 


us- 


11 


-217 


-997 


-16 


Sequence 


16, 


Appl 



GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM protein - protein search, using sw model 
Run on: August 5, 2006, 00:08:44 ; 



Title: 

Perfect score: 
Sequence : 



Search time 47 Seconds 
(without alignments) 
1944.807 Million cell updates/sec 



US-10-763-210-1 
5164 

1 MLLLGILTLAFAGRTAGGFE DQCNLHRKPQELDFCVLRPC 950 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283416 seqs, 96216763 residues 

Total number of hits satisfying chosen parameters: 



283416 



Minimum DB seq length: 
Maximum DB seq length: 



2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



PIR_80 : * 
pirl : * 
pir2 : * 
pir3 : * 
pir4: * 



Fred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length DB 


ID 


Description 


1 


2482.5 


48.1 


951 


2 


T00017 


gene ADAMTS-1 prot 


2 


1915 


37 .1 


837 


2 


T00355 


hypothetical prote 


3 


1604 


31.1 


550 


2 


T47158 


hypothetical prote 


4 


1585 


30.7 


2165 


2 


T21371 


hypothetical prote 


5 


1192 


23.1 


1205 


2 


T18517 


procollagen N-endo 


6 


681 


13 .2 


1558 


2 


C89114 


protein C37C3 . 6a [ 


7 


681 


13.2 


2167 


2 


T34395 


hypothetical prote 


8 


559.5 


10.8 


1444 


2 


T18856 


angiogenesis inhib 


9 


510.5 


9.9 


860 


2 


T16892 


hypothetical prote 


10 


493 


9.5 


951 


2 


T00260 


hypothetical prote 


11 


450.5 


8.7 


957 


2 


T15976 


hypothetical prote 


12 


411.5 


8.0 


1059 


2 


T22545 


hypothetical prote 


13 


344.5 


6.7 


571 


2 


S24789 


jararhagin C precu 



14 


337 


6. 


. 5 


609 


2 


S55270 


catrocollastatin p 


15 


329 


6, 


.4 


903 


2 


S60257 


meltrin alpha - mo 


16 


323.5 


6-. 


.3 


617 


2 


S48160 


metal loproteinase 


17 


319 


6. 


.2 


826 


2 


A60385 


monocyte surface a 


18 


313 


6. 


,1 


549 


2 


S48169 


metal loproteinase 


19 


308 


6. 


, 0 


789 


2 


S28259 


androgen- regulated 


20 


307 


5. 


.9 


411 


1 


HYSNFA 


fibrolase (EC 3.4. 


21 


304 


5. 


.9 


407 


2 


S66260 


metal loproteinase 


22 


300. 5 


5 . 


.8 


616 


2 


A55796 


ecarin precursor - 


23 


297 


5. 


.8 


480 


1 


A30065 


trigramin precurso 


24 


292 


5. 


.7 


478 


2 


JC4880 


fibrinolytic metal 


25 


290.5 


5, 


.6 


481 


2 


JC4342 


fibrinolytic prote 


26 


289 


5, 


.6 


414 


2 


S41609 


atrolysin C (EC 3 . 


27 


286. 5 


5. 


.5 


610 


2 


JC7 530 


vascular apoptosis 


28 


284 


5, 


.5 


414 


1 


HYRSAC 


atrolysin C (EC 3 . 


29 


281.5 


5, 


.5 


610 


2 


JC8056 


halysase - Gloydiu 


30 


280 


5, 


.4 


481 


2 


S43125 


trimucin precursor 


31 


277 .5 


5, 


.4 


814 


2 


G02390 


disintegrin-like m 


32 


273.5 


5, 


.3 


478 


2 


A43296 


atrolysin E (EC 3. 


33 


265 


5. 


.1 


414 


2 


S41608 


atrolysin B (EC 3. 


34 


265 


5. 


.1 


670 


2 


165967 


disintegrin-like m 


35 


260 


5. 


.0 


655 


2 


JC7850 


disintegrin and me 


36 


257 


5. 


.0 


776 


2 


S28258 


androgen-regulated 


37 


256.5 


5. 


, 0 


1074 


2 


JC5928 


semaphorin F precu 


38 


255.5 


4. 


9 


1584 


2 


T00026 


brain-specific ang 


39 


253.5 


4. 


,9 


952 


2 


T18900 


disintegrin and me 


40 


251 


4. 


9 


1042 


2 


T26644 


hypothetical prote 


41 


248.5 


4. 


8 


1170 


1 


TSHUPl 


thrombospondin 1 p 


42 


245.5 


4. 


8 


1170 


2 


A40558 


thrombospondin 1 p 


43 


244.5 


4, 


7 


419 


2 


A59414 


metalloproteinase 


44 


243 


4. 


7 


478 


2 


JQ1301 


hemorrhagic protei 


45 


241.5 


4. 


7 


484 


2 


JC8020 


metalloproteinase- 



ALIGNMENTS 



RESULT 1 
T00017 

gene ADAMTS-1 protein - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 22-Jan-1999 #sequence_revision 22-Jan-1999 #text_change 15-Mar-2004 
C;Accession; T00017 

R;Kuno, K.; Lizasa, H.; Ohno, S.; Matsushima, K. 
Genomics 46, 466-471, 1997 

A; Title: The exon/intron organization and chromosomal mapping of the mouse 
ADAMTS-1 gene encoding an ADAM family protein with TSP motifs. 
A; Reference number: Z14055; MUID: 98110583 ; PMID: 9441751 
A; Accession: TO 0017 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 

A; Residues: 1-951 <KUN> 

A; Cross-references: UNIPARC : UPI0000049F23 ; EMBL: AB00173 5 ; NID : g2809056 ; 

PIDN:BAA24501.1; PID:g2809057 

A; Experimental source: strain 129SVJ 

C; Genetics: 

A; Gene: ADAMTS-1 



> r gi 1 12644483 I sp| Q9UHI8 I ATSl^HUMAN Q ADAMTS-1 precursor (A disintegrin and metalloproteinc 

thrombospondin motifs 1) (ADAM-TS 1) (ADAM-TSl) (METH-1) 

Length=967 

Score = 471 bits (1213), Expect = 16-132, Method: Composition-based stats. 
Identities = 245/387 (63%), Positives = 297/387 (76%), Gaps = 6/387 (1%) 

^ 

Query 1 RAKRFVSIPRYVETLWADESMVKFHGADLEHylltllataarlyrHPSILNPINIvwk 60 

R KRFVS RYVET++VAD+SM +FHG+ L+HYLLTL + AARLY+HPSI N +++VVVK 
Sbjct 249 RKKRFVSSHRYVETMLVADQSMAEFHGSGLKHYLLTLFSVAARLYKHPSIRNSVSLVWK 308 

Query 61 vll IRDRDSGPKVTGNAALTLRNFCAWQKKLNKVSDKHPEYWDTAILFTRQDLCGATTCD 12 0 

+L++ D GP+VT NAALTLRNFC WQK+ N SD+ E++DTAILFTRQDLCG+ TCD 
Sbjct 309 ILVIHDEQKGPEVTSNAALTLRNFCNWQKQHNPPSDRDAEHYDTAILFTRQDLCGSQTCD 368 

Query 121 TLGMADVGTMCDPKRSCSVIEDDGLPSAFTTAHELGHVFNMPHD^A/^<VCEEVFGKLRANH 180 

TLGMADVGT+CDP RSCSVIEDDGL +AFTTAHELGHVFNMPHD+ KG + G + +H 
Sb j c t 369 TLGMADVGTVCDPSRSCSVIEDDGLQAAFTTAHELGHVFNMPHDDAKQCASLNGVNQDSH 428 

Query 181 MMSPTLIQIDRANPWSACSAAIITDFLDSGHGDCLLDQPSKPISLPEDLPGASYTLSQQC 240 

MM+ L +D + PWS CSA +IT FLD+GHG+CL+D+P PI LP DLPG SY ++QC 
Sbjct 429 MMASMLSNLDHSQPWSPCSAYMITSFLDNGHGECLMDKPQNPIQLPGDLPGTSYDANRQC 488 

Query 241 ELAFGVGSKPCP-YMQYCTKLWCTGKAKGQMVCQTRHFPWADGTSCGEGKLCLKGACVER 299 

+ FG SK CP C+ LWCTG + G +VCQT+HFPWADGTSCGEGK C+ G CV + 

Sbjct 489 QFTFGEDSKHCPDAASTCSTLWCTGTSGGVLVCQTKHFPWADGTSCGEGKWCINGKCVNK 548 

Query 300 HNLNKH RVDGSWAKWDPYGPCSRTCGGGVQLARRQCTNPTPANGGKYCEGVRVKYR^^^^6 

+ KH GSW W P+G CSRTCGGGVQ R+C NP P NGGKYCEG RV+YRS 

Sb j c t 549 TD-RKHFDTPFHGSWGMWGPWGDCSRTCGGGVQYTMRECDNPVPKNGGKYCEGKRVRYRS 607 

Query 3 57 CNLEPCPSSASGKSFREEQCEAFNGYN 3 83 ^ 

CNLE CP + +GK-t-FREEQCEA N + + 
Sbjct 608 CNLEDCPDN-NGKTFREEQCEAHNEFS 633 



^0 



